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PREFACE 


As  Commander  of  the  Health  Care  Studies  and  Clinical  Investigation  Activity 
(HCSCIA),  it  was  my  pleasure  to  host  the  very  first  Tri-Service  Performance 
Measurement  Conference  on  behalf  of  Major  General  Floyd  W. Baker,  Commander, 

US  Army  Health  Services  Command  (HSC),  and  Lieutenant  General  Bernhard  T. 
Mittemeyer,  Surgeon  General  of  the  Army.  We  at  HCSCIA  were  very  excited  at 
the  prospect  of  such  a  conference  and  continue  to  be  excited  by  the  results. 
These  Proceedings  from  the  Conference  should  prove  invaluable  for  all  of  us 
interested  in  improving  the  way  we  account  for  the  very  important  work  done 
in  delivering  health  care  to  members  of  tne  Uniform  Services  and  other  bene¬ 
ficiaries. 

Under  the  direction  of  Colonel  Gerald  D.  Allgood,  Chief  of  Staff,  Health 
Services  Command,  HCSCIA  was  given  the  ta.k  of  conducting  the  Army  Medical 
Department  Performance  Measurement  Study  (AMEDD  PMS).  This  study  is  currently 
examining  various  possibilities  for  replacing  the  Medical  Care  Composite  Unit 
(MCCU).  Very  early  in  this  project,  it  became  apparent  that  many  of  the  complex 
issues  surrounding  workload  measurement  and  productivity  analysis  are  not  unique 
to  any  one  military  medical  department.  In  addition,  any  contemplated  replace¬ 
ment  of  the  traditional  measure  of  work,  the  MCCU  or  Composite  Work  Unit  (CWU) 
as  it  is  known  in  the  Navy  and  the  Air  Force,  would  involve  all  three  Services 
and  the  Office  of  the  Assistant  Secretary  of  Defense  for  Health  Affairs 
(OASDCHA]).  General  Mittemeyer  was  convinced  that  Tri-Service  involvement  was 
not  only  desirable  but  very  necessary.  Thus,  the  need  for  this  Conference. 

The  very  excellent  agenda  of  noted  civilian  academic  speakers,  the  presen¬ 
tations  oy  members  of  the  Department  of  Defense  interested  in  the  general  topic 


of  performance  measurement,  and  the  spirit  of  cooperation  which  permeated  all 
activities  during  the  Conference,  resulted  in  an  exciting  and  productive  week. 
The  agreements  that  members  of  the  three  Services  would  continue  working  infor¬ 
mally  together  on  the  important  issues  discussed  provide  a  real  opportunity  to 
change  "business  as  usual"  in  area  of  performance  measurement,  resource  alloca¬ 
tion,  quality  of  care  monitoring,  and  interface  with  the  civilian  health  care 
delivery  community.  I  know  the  record  of  the  Conference  in  these  Proceedings 
will  serve  as  a  basis  for  future  research,  similar  future  meetings,  and  con¬ 
tinued  cooperation  and  progress. 

I  would  like  to  thank  all  participants  in  the  Conference,  especially  our 
civilian  guest  speakers.  Finally,  I  would  like  to  formally  thank  all  members  of 
HCSCIA  who  had  a  role  in  the  many  details  of  preparing  for  this  Conference  and 
publishing  these  Proceedings.  Their  hard  work  and  dedication  to  excellence  were 
in  large  part  responsible  for  the  success  of  the  Conference  as  reflected  in 
these  Proceedings.  These  personnel  include:  LTC  John  A.  Coventry,  LTC  Terry 
R.  Misener,  MAJ  David  V.  Wright,  MAJ  Michael  P.  Crutchfield,  CPT(P)  James  M. 
King,  CPT  Linda  P.  Sauer,  SFC  Timothy  M.  Canavan,  SSG  Robert  A.  Lamb,  Dr.  A.D. 
Mangelsdorf f ,  Mrs.  Pat  Twist,  Mrs.  Pat  Gilbert,  and  Mrs.  Louisa  Lohman. 


October  1984 


Fred  A.  Cecere 

Lieutenant  Colonel,  Medical  Corps 
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DRGs:  Background  and  Applications  in  Utilization 
Review  and  Hospital  Management 

Jean  L.  Freeman,  Ph.D.  and  Robert  L.  Mullin,  M.D. 

Since  the  implementation  of  Medicare's  prospective  payment  system,  the  three 
letters  DRG  have  often  provoked  rather  extreme  reactions  particularly  among 
physicians  and  hospitals  administrators.  Some  have  expressed  cautious  optimism 
that  the  system  will  have  at  least  some  impact  on  cutting  the  rising  costs  of 
hospital  care,  while  others  feel  DRGs  represent  a  hospital  acquired  disease, 
hooefully  of  limited  duration,  which  will  disappear  in  a  short  time.  In  fact, 
the  Diagnosis  Related  Groups  are  basically  a  classification  system  for  hospital 
discharges  with  much  broader  applications  for  utilization  review  and  hospital 
management . 

It  is  these  applications,  other  than  prospective  payment,  which  are  the 
focus  of  this  paper.  Prior  to  presenting  this  material,  however,  a  brief 
nistorial  background  on  the  development  of  the  DRGs  is  given.  The  major  point 
of  this  historical  perspective  is  that  group  definition  is  a  process  that  has 
evolved  over  the  past  10-15  years  in  response  to  updated  diagnostic  and  proce¬ 
dure  coding  and  to  comments  by  users  of  the  system  regarding  its  utility  as  a 
payment  system,  as  a  utilization  review  mechanism,  and  as  management  tool.  In 
the  future,  the  classification  will  continue  to  be  reviewed  and  revised  to 
accommodate  change  in  medical  practice  pattens,  advances  in  technology  and 
possibly  the  incorporation  of  other  factors  thought  to  affect  resource  use. 

H istorica 1  Background 

The  initial  development  of  a  patient  classification  system  at  Yale 
Jniveristy  began  in  the  late  1960s  and  was  largely  motivated  by  the  needs  of 
two  utilization  review  programs.  It  was  at  this  time  that  the  general  method¬ 
ology  was  established  and  an  approach  developed  that  was  to  be  used  in  all 


future  versions.  Briefly,  the  scheme  was  constructed  by  initially  dividing  all 
principal  diagnosis  codes  into  major  diagnostic  categories.  These  major  cate¬ 
gories  were  then  partitioned  into  subgroups  based  on  values  of  variables  asso¬ 
ciated  with  length  of  stay--the  only  utilization  measure  available  at  that  time. 
This  partitioning  process  was  done  by  evaluating  the  clinical  meaningfulness  of 
subgroups  formed  using  the  A.I.O.  algorithm.  A  special  statistical  system  was 
also  developved  at  this  time,  AUTOGRP,  to  support  an  interactive  version  of 
the  A. I .D.  algorithm. 

The  first  documented  version  of  the  "patient  groups"  as  they  were  referred 
to  then,  was  described  in  a  1973  working  paper  [1].  There  were  54  major  diag¬ 
nostic  categories  that  were  further  partitioned  into  333  final  groups  based  on 
values  for  the  variables  age,  presence  or  absence  of  specific  surgery  and 
secondary  diagnoses.  The  data  used  to  construct  the  scheme  consisted  of  dis¬ 
charges  from  selected  Connecticut  hospitals 

In  this  first  version  there  was  a  heavy  emphasis  placed  on  the  statistical 
attributes  of  the  groups  and  it  is  unclear  as  to  the  extent  of  physician  in¬ 
volvement.  The  scheme  was  used  to  examine  case  mix  differences  in  Connecticut 
hospitals  and  the  extent  to  which  differences  in  hospital  utilization  are  a 
function  of  differences  in  case  mix.  Another  working  paper  produced  in  1974  [2] 
suggested  further  applications  for  hospital  management  and  reimbursement. 
Research  was  started  around  this  time  in  the  development  of  a  cost  model  that 
would  enable  one  to  determine  the  costs  of  treating  patients  of  a  certain  case 
type. 

The  second  version,  produced  in  1977,  was  constructed  under  contract  with 
the  Socia1  Security  Administation  specifically  for  prospective  reimbursement. 

The  number  of  major  diagnostic  categories  was  expanded  to  83  and  the  number  of 
final  groups  to  383.  The  database  used  to  form  the  groups  contained  hospital 
discharges  from  New  Jersey,  South  Carolina,  and  a  local  New  Haven  hospital. 


Again,  group  definition  appeared  to  be  based  largely  on  statistical  criteria 
with  limited  physician  involvement.  This  was  the  first  version  to  receive  con¬ 
siderable  public  exposure,  largely  because  of  its  implementation  for  reimburse¬ 
ment  in  the  state  of  New  Jersey. 

This  increased  exposure  brought  with  it  a  number  of  criticisms  and  comments 
regarding  the  scheme's  weaknesses.  Much  of  the  criticism  was  constructive  and 
it  centered  around  the  way  the  groups  were  formed  based  on  surgical  procedures 
and  secondary  diagnoses.  Some  procedure  groups  included  any  surgical  procedure 
that  was  not  specifically  contained  in  other  groups,  often  resulting  in  a 
bizarre  mixture  of  patients  grouped  together  in  the  same  DRG  with  no  clinical 
interpretabi 1 ity.  Other  groups  were  formed  on  the  basis  of  the  presence  of  any 
listed  secondary  diagnosis  so  that  diabetes  as  a  comorbidity  was  given  the  same 
weight  as  a  hang  nai 1 . 

In  addition,  the  general i zabi 1 ity  of  the  findings  were  questioned  since 
the  data  were  for  the  most  part  limited  to  hospital  discharges  from  two  states. 
Other  criticisms  seemed  to  arise  out  of  a  general  confusion  with  the  underlying 
conceptual  framework  for  the  DRGs  and  what  was  their  purpose.  Terms  were  used 
such  as  hospital  product,  and  never  really  defined.  Certain  constraints  were 
implicitly  adopted  in  the  formation  of  the  patient  classes,  but  never  actually 
documented  in  print.  What  was  written  about  the  research  was  largely  contained 
in  working  papers  and  appendices  to  contract  reports.  So,  there  was  clearly  a 
need  to  document  more  completely  the  entire  "grouping"  process. 

The  research  group  responded  to  the  documentations  problems  by  publishing  a 
supplement  in  a  major  health  services  journal  -  Medical  Care  -  in  1980  [3]. 

Many  of  the  problems  with  the  surgical  procedures  groups  were  addressed  in  a 
third  version  of  the  DRGs  generated  in  1978  for  the  state  of  New  Jersey. 

However,  a  major  revision  of  the  system  began  in  1980  and  resulted  in  the 
fourth  version  of  the  DRGs  which  was  released  in  1982.  Since  then,  only  a  few 


modifications  have  been  made  to  selected  DRGs  to  correct  for  errors  made  in  the 
placement  of  certain  diagnosis  and  procedure  codes.  The  current  version  con¬ 
taining  these  updates  was  distributed  by  Health  Systems  International,  a  pri¬ 
vate  consulting  firm,  in  1983.  This  version  represents  a  significant  improve¬ 
ment  over  the  earlier  versions  largely  because  of  its  increased  "clinical  co¬ 
herence,"  attributed  to  the  physicians  who  consulted  on  the  project  as  well  as 
to  physicians  from  the  various  specialty  groups  who  approved  the  final  version. 

Overview  of  the  DRGs* 

Diagnosis  Related  Groups  (DRGs)  methodology  is  one  approach  to  measuring 
case  mix.  The  current  version  was  developed  by  the  Health  Systems  Management 
Group  of  the  Yale  University  School  of  Organization  and  Management  under  a  grant 
from  the  Health  Care  Financing  Administration  of  the  Department  of  Health  and 
Human  Services.  [4]  The  purpose  was  to  develop  an  inpatient  classification 
system  that  differentiates  the  amount  of  hospital  resources  required  to  provide 
care. 

Data  bases  used  were: 

1.  A  Commission  on  Professional  and  Hospital  Activity  (CPHA)  base  of  1.4 
million  records  representing  each  region  of  the  country,  teaching  and 
non-teaching  hospitals,  and  representative  bed  size  and  containing  the 
Uniform  Hospital  Discharge  Data  Set  elements  (UHDDS).  This  data  set 
contains,  among  other  things,  patient  age,  sex,  diagnoses,  surgical 
procedures,  and  disposition. 

2.  A  New  Jersey  State  Department  of  Health  financial  database. 

The  International  Classification  of  Diseases,  9th  Revision,  Clinical  Modi¬ 
fication  (ICD-9-CM)  is  the  coding  source  used  by  all  hospitals  and  was  the 
basis  of  the  groupings.  The  diagnoses  in  ICD-9-CM  were  divided  into  twenty- 
three  (23)  Major  Diagnostic  Categories  (MDCs)  corresponding  to  the  various  body 
organ  systems  and  medical  specialties  (Figure  1).  The  data  base  was  then  used 

*  This  material  appears  in  an  article  published  by  Dr.  Mullin  (Mullin  RL: 

DRG's:  A  brief  description.  Connecticut  Medicine  47:  281-282,  1983.)  and 

is  republished  here  with  the  permission  of  Connecticut  Medicine. 
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Figure  1 

LISTING  OF  TWENTY-THREE  MAJOR  DIAGNOSTIC  CATEGORIES 

Major  Diagnostic  Major  Diagnostic  Category 

Cateogry  Number  English  Description 


01 

02 

03 

04 

05 

06 

07 

08 

09 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 


Diseases  and  Disorders  of  the  Nervous 
System 

Diseases  and  Disorders  of  the  Eye 
Diseases  and  Disorders  of  the  Ear,  Nose 
and  Throat 

Diseases  and  Disorders  of  the  Respiratory 
System 

Diseases  and  Disorders  of  the  Circulatory 
System 

Diseases  and  Disorders  of  the  Digestive 
System 

Disease  and  Disorders  of  the  Hepatobiliary 
System  and  Pancreas 

Disease  and  Disorders  of  the  Musculoskeletal 
System  and  Connective  Tissue 
Disease  and  Disorders  of  the  Skin,  Subcutaneous 
Tissue  and  Breast 

Endocrine,  Nutritional,  and  Metabolic 
Diseases  and  Disorders 
Diseases  and  Disorders  of  the  Kidney  and 
Urinary  Tract 

Diseases  and  Disorders  of  the  Male 
Reproductive  System 
Diseases  and  Disorders  of  the  Female 
Reproductive  System 

Pregnancy,  Childbirth  and  the  Puerperium 
Newborns  and  Other  Neonates  with  Conditions 
Originating  in  the  Perinatal  Period 
Diseases  and  Disorders  of  the  Blood  and 
Blood-forming  Organs  and  Immunological 
Disorders 

Myeloproliferative  Diseases  and  Disorders 
and  Poorly  Differentiated  Neoplasms 
Infectious  and  Parasitic  Diseases  (Systemic  or 
Unspecified  Sites) 

Mental  Diseases  and  Disorders 
Substance  Use  and  Substance  Induced  Organic 
Mental  Disorders 

Injuries,  Poisonings  and  Toxic  Effects  of 
Drugs 
Burns 

Factors  Influencing  Health  Status  and 
Contacts  with  Health  Services 
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to  develop  statistically  and  clinically  coherent  groups.  The  principal  diagno¬ 
sis  (the  diagnosis,  after  admission  and  investigation,  which  is  the  principal 
reason  for  the  admission  to  the  hospital)  places  the  patient  in  an  MDC.  The 
first  partition  in  most  MDCs  is  the  presence  or  absence  of  a  surgical  procedure 
most  likely  performed  in  an  operating  room. 

The  operating  room  procedures  in  each  MDC  were  arranged  in  a  hierarchy  from 
most  resource  intense  to  least  resource  intense.  Those  patients  not  having  an 
operation  were  likewise  grouped  in  broad  categories  of  illness  from  the  most 
resource  intense  to  the  least  resource  intense.  Some  of  these  first  groupings 
(medical  and  surgical)  were  further  subdivided  on  the  presence  or  absence  of 
malignancy,  substantial  comorbidity  or  substantial  complication.  Age  was  also  a 
factor  in  some  partitionings. 

Age  seventy  (70)  was  found  to  be  the  significant  point  for  the  older  groups 
and  age  seventeen  (17)  for  the  younger  groups.  An  interesting  combination  of 
age  and  presence  of  a  complication  or  comorbidity  proved  to  be  a  powerful  deter¬ 
mination  of  resource  consumption.  Patients  with  a  substantial  comorbidity  or 
complication  fell  in  the  one  group,  while  all  patients  under  age  seventy  (70) 
without  a  substantial  comorbidity  or  complication  made  up  the  other  group. 

The  groupings  were  accomplished  by  a  computer  program  which  developed  dis¬ 
tinct  groups  based  on  the  greatest  reduction  of  variance  in  length  of  stay. 

These  initial  groups  were  then  reviewed  by  the  clinicians  and  reassignments 
made  for  clinical  coherence. 

There  are  467  final  groups  or  DRGs.  Patients  are  assigned  to  a  DRG  by  a 
computer  program  (qrouper)  which  looks  at  the  principal  diagnosis  (for  MDC 
assignment),  secondary  diagnoses  (for  presence  of  substantial  comorbidities  or 
complications),  surgical  procedures  (for  presence  of  an  operating  room  procedure 
and  placement  in  the  surgical  hierarchy),  age,  sex,  and  discharge  status.  Thus, 
tne  entire  DRG  assignment  process  requires  only  readily  available  information 


and  not  record  reviews  or  subjective  evaluations.  Accurate  recording  of  the 
above  elements  is  absolutely  essential  to  the  assignment  of  the  correct  DRG. 

Trim  points  (low  and  high  lengths  of  stay)  were  developed  for  each  DRG. 
Patients  falling  outside  of  those  points  are  called  outliers.  This  permits 
separate  consideration  for  the  small  group  of  truly  different  patients  that 
cannot  be  neatly  categorized  by  a  specific  DRG. 

Utilization  Review  Based  on  Practitioner  Profiles  * 

Utilization  review  for  Medicare,  Medicaid,  and  maternal  and  child  health 
programs  was  required  by  PL  92-603  (1972).  The  Joint  Commission  on  Accredita¬ 
tion  of  Hospitals  (JCAH)  has,  for  many  years,  required  a  utilization  review 
program  to  "address  over-utilization,  under  utilization,  and  inefficient 
scheduling  of  resources."  [5] 

It  became  increasingly  apparent  that  the  system  of  concurrent  utilization 
review  consisting  of  admission  certification  and  continuing  stay  review  for 
all  patients  was  neither  cost-effective  nor  efficient.  The  JCAH  Standard 
specified:  "Concurrent  review  shall  be  focused  on  those  diagnoses,  problems, 
procedures  and/or  practitioners  with  identified  or  suspected  utilization- 
related  problems."  The  establishment  of  some  type  of  focused  review  was  soon 
required  for  federal  programs. 

Efforts  to  date  have  centered  mostly  on  patient  categories,  either  by 
diagnosis,  service,  patient  classification,  or  some  other  patient  profile. 
Since  only  physicians  and  dentists  can  admit  and  discharge  patients,  and  are 
thus  responsible  for  the  patient's  length  of  stay,  it  seemed  logical  to  use 
practitioner  profiles  to  establish  a  focused  review  system. 

*  This  material  is  based  on  a  paper  presented  at  the  16th  Hawaii  International 
conference  on  System  Sciences,  Honolulu,  Hawaii,  January  1983  and  published  in 
the  Journal  of  Medical  Systems  7:  409-412,  1983).  It  is  republished  here 

with  permission  of  HICSS  and  the  Journal  of  Medical  Systems. 
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The  profile  developed  shows  the  actual  versus  the  expected  percentage  of 
a  practitioner's  patients  discharged  at  the  50th,  75th,  and  90tn  length-of- 
stay  (LOS)  percentiles  of  the  comparison  norms. 

In  late  1979,  a  profile  was  established  for  each  admitting  practitioner  on 
the  Yale-New  Haven  Hospital  medical  staff.  The  comparison  norms  (numerical  or 
statistical  measures  of  usual  observed  performance)  were  obtained  using  the  data 
from  the  Conference  of  Boston  Teaching  Hospitals.  Standards  (range  of  accep¬ 
table  variations  from  the  norms)  were  developed  by  the  hospital's  Patient  Care 
Evaluation  Committee. 

All  patients  of  practitioners  whose  profiles  did  not  meet  these  standards 
or  who  admitted  fewer  than  10  patients  per  year  were  subject  to  review. 

The  actual  review  is  performed  by  using  criteria  for  acute  hospital  stay 
developed  by  the  Patient  Care  Evaluation  Committee.  The  patients  are  reviewed 
within  48  hours  of  admission  and  then  every  3  days. 

A  steady  decline  in  the  LOS  of  focused  patients  was  noted;  however,  there 
was  little  effect  on  the  overall  length  of  stay  of  all  patients.  The  profiles 
were  regenerated  in  January  1981  using  more  up-to-date  data  from  32  primary 
university  affiliated  teaching  hospitals  throughout  the  country  to  establish 
norms.  The  drop  in  LOS  focused  patients  continued,  but  the  overall  LOS  still 
remained  virtually  the  same. 

In  October  1981  the  Patient  Care  Evaluation  Committee  tightened  the  stan¬ 
dards,  and  the  LOS  of  focused  patients  continued  to  decrease  and,  for  the  first 
time,  the  overall  LOS  showed  a  significant  decline. 

A  computer  program  was  developed  to  generate  the  profiles  illustrated  in 
Table  1.  The  diagnosis-related  groups  (DRGs)  were  used  to  adjust  for  individual 
case  mix  in  order  to  compare  practitioners. 


PRACTITIONER  PROFILES 


PHYSICIAN 

TOTAL  PATIENTS 

50th 

PERCENTILE 

75th 

90th 

A 

107 

Actual 

56.07% 

81.31% 

92.52% 

Expected 

53.74% 

76.60% 

90.59% 

Difference 

-2.33% 

-4.71% 

-1.93% 

B 

183 

Actual 

43.72% 

65.03% 

81.42% 

Expected 

54.39% 

76.96% 

90.61% 

Difference 

10.67% 

11.93% 

9.19% 

The  Actual  Percentage  is  the  composite  percentage  of  patients  discharged 
for  the  practitioner's  entire  case  load.  It  is  computed  as  follows: 

1.  Multiply  the  number  of  the  practitioner's  discharged  patients  in  each 
DRG  by  the  percentage  of  those  patients  discharged  on  or  before  the 
specific  LOS  percentile. 

2.  Sum  the  products  over  all  ORGs  with  practitioner's  case  load. 

3.  Divide  the  sum  by  the  total  number  of  the  practitioner's  patients. 

The  Expected  Percentage  is  the  percentage  of  th  practitioner's  patients 

that  would  have  been  discharged  at  the  specific  LOS  percentiles  if  the  LOS  of 
those  patients  in  each  DRG  was  identical  to  the  LOS  of  the  comparison  group's 
patients  in  the  same  DRG.  It  is  computed  as  follows: 

1.  Multiply  the  number  of  the  practitioner's  discharged  patients  in  each 
DRG  by  the  percentage  of  all  patients  in  the  comparison  group  in  that 
DRG  discharged  on  or  before  the  specific  in  LOS  percentile. 

2.  Sum  the  products  over  all  DRGs  in  the  practitioner 's  case  load. 

3.  Divide  the  sum  by  the  total  number  of  patients  in  the  practitioner’s 
case  load. 
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Total  patients  is  the  total  number  of  discharges.  Percentiles  are  the  length- 
of-stay  percentiles  in  whole  days. 

In  this  example  Practitioner  A  Discharged  107  patients.  The  percentage  of 
his  patients  discharged  at  each  LOS  exceeded  the  expected  values.  His  patients 
are  not  reviewed.  Practitioner  B  discharged  183  patients.  Ihe  percentage  of 
his  patients  discharged  at  each  percentile  was  less  than  the  expected  values  by 
more  than  the  allowed  amount  (+5%).  His  patients  are  subject  to  review. 

The  profiles  are  regenerating  approximately  every  6  months,  thus  eliminating 
from  review  those  practitioners  who  have  modified  their  practices  and  fall 
within  the  standards,  and  adding  those  practitioners  whose  practices  have  fallen 
below  the  standards. 

The  original  standards  we,  e:  Actual  Percentage  not  greater  than  the 
Expected  Percentage  by  more  than  +10%  at  the  50%  percentile,  +10%  at  the  75th 
percentile,  and  +5%  at  the  90th  percentiles.  The  revised  standards  established 
in  October  1981  were:  Actual  Percentage  not  greater  than  +5%  at  the  50th,  75th, 
and  90th  pecentiles.  Current  standards  established  in  October  1982  are:  Actual 
Percentage  not  greater  than  +4%  at  the  50th,  75th,  and  90th  percentiles. 

Table  2  shows  the  experience  to  date  of  the  system.  It  is  readily  apparent 
that  the  original  standards  (+10%,  +10%,  +5%),  while  dramatically  reducing  the 
LOS  of  the  focused  patients,  had  no  effect  on  the  overall  LOS. 

A  significant  effect  on  total  LOS  is  seen  for  FY  1982,  coinciding  with  the 
tighter  standards  starting  in  October  1981.  This,  of  course,  is  the  result  of 
reviewing  more  patients. 

We  have  attempted  to  develop  a  rational  system  of  focused  review  of  bed 
utilization.  The  diagnosis  related  groups  (DRGs)  are  the  basis  for  case  mix 
adjustment,  and  specific  computer  programs  have  been  developed  to  compare  each 
pract i t ioner ' s  performance  against  established  norms.  The  system  has  been 
extremely  successful  in  directing  review  to  the  correct  areas. 
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Table  2 

LENGTH-OF  STAY  RESULTS 

All  Patients 
Year-To-Date 


Focus 

Nonfocus 

Total 

FY 

1980 

12.16 

5.83 

7.76 

FY 

1981 

10.50 

6.45 

7.76 

FY 

1982 

9.51 

6.40 

7.61 

*  Standard  revised  October  1981. 

Although  the  overall  effect,  i.e.,  a  .15-day  drop  in  average  LOS,  may  not 
appear  to  be  substantial,  this  represents  approximately  5,400  patients  days. 

Management  Applications 

From  a  management  perspective,  the  DRGs  were  designed  to  form  operational 
definitions  of  the  products  of  a  hospital.  Specifically,  the  production  process 
is  described  in  Figure  2.  The  hospital's  outputs  are  the  specific  goods  and 
services  it  provides  to  patients.  These  include  the  X-rays,  medications,  and 
lab  tests  ordered  by  physicians  as  well  as  nursing  care,  and  certain  hotel 
and  social  services.  The  special  set  of  services  ordered  by  a  physician  for  a 
given  patient  is  defined  as  a  "product  of  the  hospital." 

The  DRGs  were  constructed  to  identify  types  of  patients  who  are  expected  to 
receive  a  similar  set  of  services  by  any  given  physician.  Several  examples  of 
tne  more  common  DRGs  are  listed  in  Figure  2.  Various  department  heads  in  the 
hospital  are  responsible  for  the  conversion  of  inputs  to  outputs.  The  physi¬ 
cian,  however,  is  responsible  for  the  "bundling"  or  "packaging"  of  outputs  which 
are  ultimately  provided  to  the  patient. 
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Figure  2 

DEFINING  THE  PRODUCT  OF  THE  HOSPITAL 


HOSPITAL 

PHYSICIAN 

OPERATIONS 

ORDERS 

INPUTS 

OUTPUTS 

PRODUCTS 

Labor 

X-rays 

Vaginal  Delivery  w/o 

Materials 

Medications 

Complicating  Diagnoses 

Equipment 

Lab  Tests 

Management 

Nursing  Care 

Appendectomy  w/o 

Meal  s 

Complicated  Principal 
Diagnosis,  Age  <70  w/o  CC 

★ 

Lens  Procedures 

★ 

★ 

★ 

* 

Under  Medicare's  prospective  payment  system  (PPS),  hospitals  are  paid  a  spe¬ 
cific  amount  for  each  patients  stay  based  on  a  DRG  specific  rate.  If  the  hospi¬ 
tal's  cost  are  less  than  the  rate,  it  may  keep  the  difference  but  if  its  costs 
are  higher  than  the  rate  the  excess  must  be  absorbed  by  the  hospital. 

Clearly,  it  is  critical  that  an  institution  have  a  detailed  knowledge  of  its 
costs  for  treating  specific  types  of  patients.  For  the  past  ten  years,  research 
has  been  conducted  by  various  members  of  the  Health  Systems  Management  Group 
into  methods  of  cost  finding  by  DRGs  and  the  general  approach  that  has  been 
developed  is  referred  to  as  "case  mix  accounting."  Case  mix  accounting  has  been 
extensively  documented  elsewhere.  [7,  8]  Presented  here  is  only  an  overview. 

The  reader  is  referred  to  the  articles  for  more  details.  (See  Appendix  E) 

Figure  3  (refer  to  [7])  summarizes  the  process  of  determining  the  costs  of 
treating  patients  in  each  of  the  DRGs  for  a  given  hospital.  The  ultimate 
objective  of  the  process  is  to  allocate  all  hospital  costs  on  per  patient 
basis.  Initially  a  hospital's  chart  of  accounts  is  divided  inti,  six  areas:  1) 


OVERVIEW  OF  THE  METHODS  Chart  Of  Accounts 

FOR  DETERMINING  THE  COST  OF  ~l~~ 

TREATING  PATIENTS  IN  EACH  ORG  I  Accounts 
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nonpatient  care  related  overhead  accounts  (such  as  depreciation),  2)  outpatient 
accounts,  3)  patient  care  related  overhead  accounts  (such  as  laundry,  house¬ 
keeping),  4)  direct  inpatient  hotel  and  other  general  services  (such  as  medical 
records,  billing),  5)  nursing  accounts,  and  6)  ancillary  accounts  (such  as 
laboratory,  radiology).  What  are  referred  to  as  final  cost  centers  are  defined 
from  the  latter  three  categories  involving  direct  patient  care  related  costs. 

For  these  final  cost  centers  resource  consumption  statistics  are  developed 
which  allow  us  to  allocate  costs  to  patients  in  each  DRG. 

Prior  to  this  step,  patient  care  related  overhead  costs  are  allocated  to  the 
final  cost  centers.  This  initial  allocation  process  for  the  overhead  accounts  is 
similar  to  the  standard  hospital  accounting  step  down  procedure.  For  example, 
housekeeping  is  allocated  on  the  basis  of  hours  worked,  laundry  and  maintenance 
on  the  basis  of  square  feet. 

Once  this  is  done,  the  direct  costs  of  each  final  cost  center  and  the  allo¬ 
cated  overhead  costs  represent  the  total  cost  of  providing  the  serivce  associated 
within  each  cost  center.  As  mentioned  earlier,  this  total  cost  is  then  allo¬ 
cated  to  individual  patients  on  the  basis  of  some  resource  consumption  statistic 
such  as  relative  value  units  or  charges  for  the  ancillary  cost  centers,  patients 
days  weighted  by  DRG,  for  nursing. 

Once  costs  have  been  allocated  to  patients,  managment  analyses  and  reports 
can  be  generated  which  give  a  summary  and  detailed  breakdown  by  each  DRG  by  the 
various  cost  centers.  Consider,  for  example.  Tables  2  and  3  in  Thompson  et.  al . , 
[7].  The  tables  present  a  summary  of  selected  utilization  and  cost  statistics 
for  a  given  DRG.  Note  that  69  patients  were  treated  for  a  total  cost  $271,0 37.87 
or  a  per  patient  cost  of  $3,928.00.  A  detailed  breakdown  such  as  that  appearing 
in  Table  3  gives  the  costs  arising  from  each  final  cost  center.  Cost  comparisons 
may  also  be  made  from  year  to  year  as  in  Table  2. 


In  addition  to  cost  finding  in  a  given  year,  this  approach  may  be  used  for 
hospital  budgeting.  That  is,  expenses  may  be  predicted  using  historical  cost 
information  and  patient  volume  predictions. 
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THE  HCFA  EXPERIENCE  ANO  WORK  MEASUREMENT  FOR  THE  MILITARY 


Stephen  Jencks,  M.D. 


A.  INTRODUCTION 

This  paper  is  about  three  aspects  of  the  HCFA  experience  with  DRGs 
which  may  be  useful  to  the  Department  of  Defense  as  it  considers  whether  to 
revise  or  replace  its  workload  measurement  formula: 

1.  The  present  and  future  status  of  the  DRG-based  system  which  HCFA 
operates  for  Medicare. 

2.  The  way  in  which  HCFA  calculates  weights  and  the  usefulness  and 
limitations  of  these  weights  in  the  DoD  system. 

3.  The  problems  of  variations  in  case  mix  which  are  not  measured  by 
the  DRG  system  and  the  implications  of  this  unmeasured  variation 
for  DoD. 

To  start  this  discussion,  however,  some  differences  and  similarities  in 
perspective  are  important.  First,  HCFA  is  interested  in  reimbursement,  which 
means  that  it  is  interested  in  the  amount  of  money  necessary  to  care  for  a 
patient.  Although  DoD  is  interested  in  this  issue,  it  is  also  interested  in 
certain  issues  which  do  not  interest  HCFA.  For  example,  DoD  must  decide  the 
manning  and  staffing  of  medical  activities.  Some  missions  of  DoD  medical  activ¬ 
ities,  such  as  readiness,  may  get  underplayed  or  unfunded  if  comparison  of  HCFA 
and  DoD  costs  is  overemphasized.  Although  I  am  aware  of  these  issues,  I  have, 
in  general,  not  addressed  them. 

But  HCFA  shares  with  DoD  a  feeling  that  some  medical  activities  use  their 
resources  more  efficiently  than  others  and  that  knowing  which  are  which  would  be 
most  useful  for  both  management  and  resource  allocation.  We  have  in  common  a 
belief  that  not  all  care  costs  are  necessary  and  that  a  formula  for  allocating 
resources  invites  very  useful  comparisons  and  adjustment  of  practices  if  the 
formula  is  roughly  fair  (and  abuse  if  it  is  not).  And  we  both  are  now  dividing 
a  finite  total  pot,  not  drawing  from  a  bottomless  well.  The  question  for  DoD  is 
whether  a  formula  using  DRGs  could  be  more  fair  than  the  current  formula. 
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B •  M  PRESENT  AND  FUTURE  OF  PPS 

I  want  to  make  very  clear  that  what  I  am  about  to  say  does  not  represent  an 
official  HCFA  position,  not  because  I  disagree  with  the  HCFA  position  but 
because  HCFA  does  not  have  an  official  position  on  many  of  the  issues  I  will  be 
discussing.  What  follows  is  only  the  best  judgment  of  an  informed  observer, 
which  I  believe  to  be  consistent  with  HCFA  policy.  I  have  framed  my  thoughts 
around  five  questions: 


PPS  is  the  entire  system  of  regulations,  weights,  and  conversion 
factors  which  makes  DRGs  a  way  to  pay  hospitals.  PPS  was  enacted  by  Congress, 
and  PPS  could  be  repealed  by  Congress  at  any  time.  One  must  be  especially 
cautious  because  PPS  went  from  idea  to  law  so  very  rapidly  by  Congressional 
standards  that  further  rapid  changes  must  not  be  ruled  out.  Nevertheless,  I 
think  that,  for  some  years  to  come,  changes  in  Medicare's  payment  system  will  be 
evolutionary  rather  than  revolutionary.  The  direction  of  evolution  will  be 
toward  more  comprehensive  payment  systems  leading  toward  capitation,  regardless 
of  the  administration  in  Washington.  I  think  we  will  move  toward  capitation 
because  so  many  people  either  like  the  idea  or  are  coming  to  like  it.  Those  who 
want  to  reduce  the  Federal  bureaucracy  like  a  system  which  permits  just  turning 
over  a  capitation;  fee  to  a  provider  chosen  by  the  Medicare  beneficiary;  those 
who  want  more  prevention  and  efficiency  like  the  flexibility  of  capitation,  some 
whom  we  might  once  have  labeled  liberals  like  the  concept  of  HMOs.  And  I  think 
that  organized  medicine  is  rapidly  finding  capitation  more  attractive  as  the 
very  aggressive  cost  control  intentions  of  both  Congress  and  the  Administration 
becomes  clear.  I  personally  doubt  that  there  will  be  another  revolution  com¬ 
parable  to  PPS  between  now  and  capitation,  although  I  expect  some  very 
interesting  innovations.  But  PPS  came  very  fast  and  something  else  cou.d  come 
equal ly  fast . 
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2 .  Are  DRGs  here  to  stay? 

I  think  that  DRGs  are  very  unlikely  to  be  displaced  by  one  of  the  com¬ 
peting  systems  such  as  Patient  Management  Pathways,  Staging,  Severity,  or 
MEDISGRPS.  On  the  other  hand,  I  think  it  is  very  likely  that  DRGs  will  be 
somewhat  modified  over  coming  years,  and  somewhat  likely  that  they  will  be  a 
great  deal  modified.  There  are  enough  areas  of  significant  recognized 
diff iculty--such  as  psychiatric,  rehabilitation,  and  pediatric  hospitals--and 
enough  specific  technical  questions  so  that  modifications  seem  inevitable.  And, 
I  would  add,  if  such  growth  were  not  possible  DRGs  would  constitute  a  serious 
disservice  to  the  delivery  of  health  care.  On  the  other  hand,  DRGs  have  some 
practical  advantages  over  the  competition  which  makes  more  likely  an  engrafting 
of  features  of  other  systems  onto  the  DRG  system. 

3.  What  research  is  in  progress? 

There  is  a  great  deal  of  research  in  the  area  of  prospective  payment, 
and  summarizing  it  would  take  a  long  time.  In  addition,  a  number  of  private 
sector  projects  are  underway.  But  I  think  there  are  several  areas  which  I 
should  highlight: 

a.  HCFA,  ASPE,  and  NCHSR  are  currently  supporting  work  on  a  number  of 
alternative  systems,  including  but  not  limited  to  Staging,  Severity  of  Illness, 
Patient  Management  Categories  and  APACHE.  In  addition,  we  are  watching  with 
interest  work  on  MEDISGRPS.  Other  systems  seem  likely  to  come  over  the  horizon. 

b.  HCFA  is  looking  for  ways  to  improve  DRGs  so  that  currently  uncovered 
hospitals--pediatric,  psychiatric,  and  rehabi 1 itation--can  be  covered. 

c.  HCFA  is  looking  at  ways  of  solving  a  number  of  more  specific 
problems  such  as  simultaneous  replacement  of  two  hips,  treatment  of  major  burns, 
different  types  of  pacemakers,  hospice-related  care,  and  the  differences  between 
alcohol  detoxification  and  rehabilitation. 
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d.  HCFA  is  beginning  to  look  at  the  impact  of  PPS  on  various  types  of 
hospitals  to  see  whether  there  are  groups  of  hospitals  which  fare 

well  or  poorly  under  PPS. 

e.  Under  orders  from  Congress,  HCFA  is  investigating  conducting  a 
number  of  other  investigations:  whether  physicians  should  be  reimbursed  for  in- 
hospital  care  under  DRGs,  whether  the  DRG  system  adequately  describes  variations 
in  case-mix,  whether  urban-rural  price  differences  should  be  preserved,  the  ade¬ 
quacy  of  outlier  payments,  the  impact  of  PPS  on  sole  community  hospitals,  the 
costs  of  uncompensated  care,  the  position  of  large  rural  teaching  hospitals,  the 
effect  of  providing  cost  of  care  information  to  patients,  the  recalibration  of 
weights,  the  impact  of  state  alternatives,  and  last,  but  by  no  means  least,  the 
actual  impact  of  PPS  on  hospitals. 

Some  of  these  issues,  such  as  the  costs  of  uncompensated  care,  appear 
to  be  of  limited  interest  to  DoD  while  others,  such  as  the  position  of  the  sole 
community  hospital,  with  attendant  costs  of  large  variation  in  occupancy,  appear 
to  have  a  direct  relevance  to  OoD's  problems. 

4 .  What  changes  in  DRGs  should  DoD  anticipate? 

a.  There  will  be  new  weights  for  DRGs  and  those  weights  will  continue 
to  change  in  the  future.  This  has  practical  implications  for  the  way  in  which 
DoD  might  choose  to  adopt  weights  from  the  civilian  sector. 

b.  The  DRG  algorithm  will  change,  even  though  it  might  not.  I  do  not 
think  that  you  would  be  wise  to  adopt  the  DRG  algorithm  without  deciding  to 
include  future  changes  as  part  of  your  adoption.  This  is,  or  ought  to  be,  a 
dynamic  system.  But  you  should  also  anticipate  that  these  changes  will,  at 
least  initially,  be  compatible  with  the  current  system  and  that  DoD,  like  HCFA's 
fiscal  intermediaries  and  carriers,  will  be  able  simply  to  mount  a  new  program 
on  a  computer  without  reorganizing  all  of  its  implementation. 
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c.  The  algorithm  will  continue  to  rely  on  the  Uniform  Hospital 
Discharge  Data  Set  in  the  near  term.  1  frankly  do  not  know  whether  there  will 
be  changes  further  down  stream,  but  am  willing  to  bet  that  changes  in  the  hasii 
data  set  will  occur  if  we  do  not  have  universal  capitation  by  1990. 

d.  Changes  to  make  DRGs  more  workable  for  psychiatric  and  rehabilita¬ 
tion  services,  two  current  deficiencies  should,  1  think,  be  of  special  concern 
to  DoO. 

e.  Changes  in  physician  reimbursement  by  HCFA,  although  it  is  unclear 
whether  physician  DRGs,  relative  value  scales,  or  some  other  option  will  be  cho¬ 
sen  . 

5.  What  is  HCFA's  approach  to  integrating  inpatient  and  outpatient 

services  for  work  measurement  or  payment? 

lo  date,  HCt A  has  not  been  able  to  obtain  satisfactory  approaches  to 
the  problem.  Two  strategies  are  currently  on  the  research  agenda: 

a.  For  care  surrounding  a  hospitalization,  we  are  seeking  to  deline 
service  windows  which  might  be  used  to  include  those  ambulatory  service's  as  part 
of  inpatient  care. 

b.  For  general  ambulatory  care,  we  have  systems  which  characterize 
individual  visits  but  no  system  which  adequately  character i zes  episodes  of  care. 
The  episode  of  care  seems  to  be  the  appropriate  unit  of  work  for  outpatient  care 
just  as  the  admission  is  the  unit  of  work  for  inpatient  care  under  DRGs.  But  it 
is  very  difficult  to  define  these  episodes  normat.  i  vel y  and  it  is  fairly  dif¬ 
ficult  to  define  them  from  a  medical  record.  Accordingly,  we  are  not  optimist  n 
that  a  solution  to  this  problem  is  within  the  state-of -the-art .  fhe  risk  one 
runs  is  that  if  recognition  of  workload  is  by  the  visit,  facilities  which  use  an 
unnecessary  number  of  visits  to  manage  their  patients  will  be  mistakenly 
recognized  as  efficient.;  thorp  is  even  a  risk  that  this  will  function  as  a  per¬ 
verse  incentive  driving  the  system  in  directions  iri  which  you  would  not  wish  it 
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to  go.  OoD  should  consider  adopting  a  procedure  code  compatible  with  HCFA  in 
order  to  more  reasonably  reflect  the  resource  cost  of  ambulatory  care. 

C.  WEIGHTS 

The  way  in  which  HCFA  has  derived  the  weights  for  DRGs  is  complicated  and 
fairly  specific.  In  essence  these  weights  are  based  on  HCFA's  determination  of 
costs  compared  to  an  average  admission,  but  to  understand  this  concept  you  need 
to  understand  a  number  of  features  of  the  system: 

1.  HCFA  calculates  costs  in  a  very  special  way  which  is  founded  as  much  in 
law  as  in  either  accounting  or  economics.  Certain  costs  are  allowable  and 
others  are  not--for  example,  teaching  costs  are  covered  in  a  special  way  rather 
than  as  part  of  overall  operating  costs,  bad  debt  is  excluded,  depreciation  is 
covered  outside  of  the  weights,  and  weights  are  synthesized  in  a  relatively 
complex  manner.  Nevertheless,  the  correlation  of  weights  derived  from  HCFA's 
"costs"  to  weights  derived  from  hospital  charges  is  about  .98  or  .99,  so  I  don't 
think  you  need  to  be  too  concerned  about  HCFA's  cost-finding  procedures.  There 
are  other  data  sets  to  which  you  would  want  to  go  for  information,  in  any  case, 
because  there  weren't  enough  cases  in  some  DRGs  in  HCFA's  experience  to  weight 
them  properly  and  because  HCFA  relied  primarily  on  a  data  set  which 
overwhelmingly  consists  of  the  elderly. 

2.  HCFA  trims  its  data  base. 

HCFA  has  a  system  for  paying  out! iers--that  is,  cases  which  stay  either 
much  longer  or  less  long  than  was  expected  from  the  DRG--which  takes  those  cases 
out  of  the  weighting  system.  The  reason  for  this  decision  is  that,  statisti¬ 
cally,  the  DRG  system  becomes  highly  unstable  if  outliers  are  included.  So 
HCFA  pays  for  outliers  separately.  DoD  would  need  either  to  treat  outliers  the 
same  way  or  to  compute  new  weights.  The  outlier  system  is  probably  a  prudent 
way  of  handling  one  of  the  shortcomings  of  the  DRG  system. 
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3.  HCFA  data  does  not  include  physician  charges.  Such  data  will  become 
available  in  the  next  year  or  two,  and  DoD  could  take  advantage  of  it,  but 
current  weights  will  not  include  physician  charges.  Further,  it  is  arguable 
that  there  are  great  differences  between  DoD  physician  costs  and  civilian  physi¬ 
cian  charges,  and  I  am  unsure  whether  we  could  find  data  to  approximate  DoD's 
cost  of  physician  services. 

4.  HCFA  converts  weights  to  costs  with  a  multiplier  which  considers  urban/ 
rural  and  regional  differences  in  costs  of  running  a  hospital.  DoD  would  pro¬ 
bably  need  to  use  a  rather  different  formula  because  labor  costs  vary  in  a  very 
different  way  in  civilian  and  military  settings,  and  DoD  might  wish  to  make 
overall  adjustments  because  of  its  cost  structure. 

5.  HCFA  does  not  recognize  administrative  days  or  any  equivalent  thereof 
as  part  of  a  DRG  weight.  This  is  an  important  policy  call  in  implementation  for 
DoD. 

6.  HCFA  has  a  system  for  paying  for  transfers  which  DoD  would  do  well  to 
examine  very  carefully  and  to  which  there  may  be  preferable  alternatives. 

7.  HCFA  uses  special  adjustments  for  teaching  costs.  Although  DoD  could 
adopt  these,  it  may  not  want  to:  surely  residents  are  the  individuals  whom  we 
should  be  most  able  to  monitor  and  to  teach  to  practice  efficient  medicine.  If 
we  can't  modify  their  behavior,  whose  behavior  can  we  modify? 

8.  HCFA  does  not  distinguish  between  categories  of  patients  but  DoD  might, 
for  example,  wish  to  apply  different  weights  for  active  duty  personnel.  An 
overall  special  weight,  applied  as  a  general  multiplier,  could  easily  be  calcu¬ 
lated  by  regression  techniques  using  the  hospital  as  the  unit  of  analysis,  per¬ 
cent  active  duty  and  the  hospital  case  mix  index  as  independent  variables,  and 
cost  per  admission  as  the  dependent  variable. 


D.  VARIATIONS  IN  CASE  MIX  UNMEASURED  BY  THE  DR 6  SYSTEM 

I  did  much  of  the  thinking  which  follows  as  I  was  trying  to  work  out  how 
HCFA  should  approach  the  problem  of  case  mix  variation.  Thus,  I  have  laid  out 
the  problems  in  terms  of  how  they  might  be  addressed,  rather  than  their  analytic 
relationship.  I  offer  no  promise  that  HCFA  will  do  the  research  envisioned,  but 
this  approach  may  give  you  my  ideas  as  to  how  solutions  might  develop.  In  no 
possible  sense  is  this  discussion  to  be  construed  as  the  HCFA  research  agenda-- 
it  is  one  person's  notion  of  how  such  research  might  be  approached. 

1.  The  Nature  Of  The  Problem:  Unmeasured  case  mix  variation  is  important 
because,  if  it  is  unevenly  distributed  across  hospitals,  it  results  in  inequi¬ 
table  judgments  as  to  hospital  performance.  Such  inequities  can  be  unhelpful 
and  could  result  in  a  system  which  is  even  more  perverse  than  Consolidated 
Workload  Measurement.  It  is  uncertain,  at  present,  how  serious  this  problem  may 
be  with  DRGs. 

a.  Necessary  Treatment  v.  Severity  and  Intensity:  To  frame  our 
approach,  we  should  first  recognize  that  while  many  critics  of  the  DRG  system 
focus  on  severity  of  illness  and  intensity  of  care,  unmeasured  "severity"  or 
intensity  of  care  may  not  be  the  most  important  issues  in  considering  the  ade¬ 
quacy  of  DRGs.  Both  HCFA  and  DoD  are  more  appropriately  concerned  about  the 
resources  necessary  to  treat  the  patient's  condition,  and  severity  of  illness 
and  intensity  of  care  are  only  clues  to  the  resources  required  for  necessary 
treatment.  Necessary  treatment  may  be  quite  different  from  severity  in  many 
cases,  such  as  terminal  cases  in  which  little  treatment  is  appropriate  and  early 
cases  in  which  intensive  treatment  may  prevent  complications.  The  distinction 
between  necessary  treatment  and  the  intensity  of  treatment  actually  rendered 
is  sometimes  difficult,  but  a  central  premise  of  PPS  is  that,  in  a  large  number 
of  cases,  less  treatment  is  necessary  than  has  actually  been  rendered  and  that 
economies  can  be  achieved  through  eliminating  surplus  treatment  activities  such 


as  prolonged  hospital  stay.  Therefore  we  should  also  be  careful  to  distinguish 
intensity  of  care  from  necessary  treatment.  This  distinction  does  not  require 
that  we  develop  norms  of  treatment  or  that  we  base  weights  on  some  hypothetical 
ideal  treatment,  but  we  must  keep  in  mind  that  PPS  was  introduced  on  the  assump¬ 
tion  that  a  significant  amount  of  treatment  not  rendered  is  not  necessary. 

In  thinking  about  these  problems  we  should  adhere  to  the  language  of 
treatment  requirements  and  consider  "severity"  and  intensity  of  care"  only  as 
approaches  which  may  prove  useful  in  measuring  treatment  requirements. 

b.  Hospitals  v.  Cases:  Although  case  mix  indices  seek  to  classify 
individual  cases,  no  proponent  of  an  index  has  suggested  that  any  index  can 
exactly  describe  each  individual  case.  The  operational  requirement  is  that  an 
index  accurately  describe  the  average  case  mix  of  each  hospital,  since  the 
hospital  is  the  unit  at  which  risk  is  pooled  and  to  which  performance  is 
assessed.  The  ultimate  test  of  a  case  mix  system,  whether  DRGs  or  a  modifica¬ 
tion,  is  fairness  to  hospitals. 

Our  criterion  in  thinking  about  case  mix  variation  should  be  the 
impact  on  hospitals  and  classes  of  hospitals  rather  than  the  characterization 
of  individual  cases. 

c.  A  Hierarchy  of  Solutions:  In  approaching  the  problem  of  unmeasured 
variation  in  requirements  for  treatment,  we  should  consider  a  heirarchy  of  solu¬ 
tions  which,  if  we  start  on  all  of  them  now,  can  be  available  at  various  times 
in  the  future. 

(1)  For  the  short  range  consider  new  ways  to  compute  weights  for 
DRGs,  ways  to  merge  DRGs  with  Staging  and  Patient  Management  Categories,  and 
refinements  based  on  previous  patient  treatment.  The  present  lack  of  com¬ 
puterization  in  Severity  of  Illness  precludes  merging  it  with  the  DRG  system  at 
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(2)  For  the  intermediate  range  consider  ways  to  test  variations  between 
hospital  types  and  determine  whether  hospital  type  weights  would  make  the  system 
more  equitable.  Such  solutions  should  be  available  by  1986-7. 

(3)  For  the  long  range,  consider  ways  to  create  types  of  case  mix 
systems  which  represent  a  full  generation  of  advance  over  the  present  DRGs. 

2.  Short-Range  Solutions:  There  are  three  kinds  of  short-range  solutions-- 
those  using  cleaner  data  or  better  procedures  to  weight  the  DRGs,  those  using 
methods  from  other  UHDDS-based  systems  to  improve  DRGs,  and  using  treatment 
history  to  improve  DRGs. 

a.  Weighting  procedures: 

(1)  The  data  used  for  creating  weights  should  be  as  clean  as 

possible. 


(a)  We  should  institute  a  diligent  search  for  the  most 
reliable  Medicare  data  sets.  This  particularly  means  using  PROS,  the  work  of 
former  PSROs,  PHDDS ,  and  intermediary  information  to  identify  those  inter¬ 
mediaries,  states,  and  regions  with  especially  low  error  rates.  We  should  also 
search  for  editing  procedures  which  may  improve  data  cleanliness. 

(b)  We  should  exclude  care  which  has  been  disallowed  by 
review.  This  may  imply  a  preference  for  data  from  regions  where  review  has  been 
especially  vigorous  or  effective. 

(2)  We  should  test  modifications  to  the  weighting  system  which 
would  counteract  known  or  suspected  biases.  For  example,  we  know  that  the  use 
of  flat  per  diems  understates  the  variation  in  nursing  costs.  We  also  know  that 
data  errors  are  likely  to  cause  weights  to  be  erroneously  close  to  the  mean. 

This  suggests  the  possibility  that  the  variance  of  DRG  weights  is  less  than  it 
should  be  and  that  the  entire  set  of  weights  should  be  rescaled  to  a  larger 
standard  deviation.  This  procedure  is  mathematically  easy  and  can  be  tested  by 
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determining  whether  the  calculated  hospital  reimbursements  more  closely  match 
historical  costs.  A  study  of  nursing  intensity  such  as  that  proposed  by 
Thompson  at  Yale  would  shed  further  light  on  this  issue. 

b .  Synthesize  DRGs  with  Staging  and  Patient  Management  Categories: 
Staging  and  Patient  Management  Categories  are  computerized  systems  which  operate 
on  the  current  Medicare  data  set.  Although  merging  the  methods  presents 
substantial  technical  problems,  work  on  such  a  synthesis  for  staging  of  a  few 
categories  is  already  proceeding  under  an  ASPE  contract.  A  broader  effort 
involving  both  Staging  and  Patient  Management  Categories  is  conceptually 
straightforward  and  should  go  forward  as  soon  as  the  1983  MEDPAR  file  is  in 
hand.  The  fundamental  technique  is  to  examine  areas  in  which  the  performance  of 
Staging  or  Patient  Management  Categories  appears  to  be  as  good  as  or  better  than 
that  of  DRGs  and  then  to  synthesize  systems. 

c.  Previous  Treatment  as  a  Patient  Variable:  An  important  piece  of 
information  about  patients--previous  inpatient  and  perhaps  outpatient  treatment¬ 
's  available  from  Medicare  files  and  from  DoD  files  but  has  not  been  used  for 
case  mix  analysis.  Previous  treatment  may  be  an  indicator  of  comorbidity  and 
the  severity  of  disease,  and  may  be  more  effective  than  the  comorbidity  indica¬ 
tors  used  in  the  current  Medicare  system.  The  Medicare  file  record  of  previous 
admissions  may  also  be  a  useful  tool  for  approaching  the  problem  of  "split 
admissions":  cases  in  which  medical  judgment  would  permit  managing  a  problem  on 
either  a  single  admission  or  on  two  admissions. 

We  should  be  aware,  while  seeking  to  refine  DRGs  by  these  methods, 
that  enlarging  the  number  of  DRGs  presents  no  theoretical  problems  so  long  as 
enough  data  ’s  available  to  calculate  suitable  weights.  The  case  mix  of  a 
hospital  will  generally  be  more  accurately  estimated  if  there  is  a  small  number 
of  cases  in  each  of  a  large  number  of  DRGs  than  if  there  is  a  large  number  of 
cases  in  each  of  a  small  number  of  DRGs. 


28 

3.  Intermediate  Range:  In  the  middle  range  we  should  be  able  to  complete 
studies  to  characterize  the  case  mix  of  hospitals  using  either  of  two 
strategies : 

a.  Overall  character i zation  using  multiple  methods:  This  approach 
seeks  to  characterize  the  case  mix  of  hospitals  using  MEDISGRPS,  Severity  of 
Illness,  and  other  methods  to  supplement  the  refined  DRGs  developed  in  the  short 
range  program.  Because  using  these  methods  requires  primary  data  collection,  it 
cannot  be  accomplished  in  the  near  term.  The  methods  can  assume  special  power 
if  they  are  used  to  cross-val i date  one  another.  The  goal  is  to  reach  more 
sophisticated  conclusions  than  are  now  possible  regarding  differences  in  hospi¬ 
tal  case  mix.  In  particular,  this  strategy  would  allow  us  to  determine  whether 
certain  categories  of  hospitals  such  as  teaching  hospitals  should  have  special 
weights. 

Using  multiple  methods  will  address  the  problem  that  different 
methods  have  different  weaknesses,  and,  specifically,  that  different  methods  are 
vulnerable  in  different  ways  to  confusing  the  care  which  was  necessary  with  the 
care  which  was  actually  rendered.  The  most  clinical  of  classification  systems 
(e.g.,  Staging,  MEDISGRP,  Patient  Management  Categories)  include  the  results  of 
whatever  diagnostic  efforts  have  been  made,  and  for  some  diagnoses  this  implies 
that  expenditures  have  already  been  made  on  diagnostic  activity.  More  often 
(e.g.,  DRGs),  the  system  also  describes  some  of  what  was  done  (e.g.,  procedures), 
and  thus  depends  on  an  external  system  of  review  to  determine  whether  such 
treatment  was  necessary.  Sometimes  (e.g..  Severity),  the  system  uses  the 
details  of  the  chart  in  a  relatively  impressionistic  way  and  it  becomes  dif¬ 
ficult  to  determine  whether  the  reviewer  is  responding  to  the  patient's  clinical 
condition  or  to  what  was  done.  Admitting  diagnosis,  which  can  illuminate 
reasons  for  differences  between  discharge  data  and  treatment  rendered,  may  well 

become  available  on  the  Medicare  data  set;  this  data  could  contribute  to  the 
cross-validation  analyses. 
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The  goal  of  this  combined  analysis  would  be  to  clarify  the  degree 
to  which  indices  sucn  as  Severity  of  Illness  characterized  the  care  patients 
needed  or  only  the  care  which  they  received.  With  this  validation  accomplished, 
it  would  be  possible  to  compare  different  hospitals  and  classes  of  hospitals 
with  an  order  of  magnitude  and  more  confidence  in  our  results  than  we  can  now 
achieve  with  DRGs. 

b.  A  second  approach  to  determining  differences  in  case  mix  is  to 
carefully  analyze  certain  tracer  conditions  to  determine  whether  hospitals  have 
different  costs  for  conditions  which  are  identical  when  fully  corrected  for 
intensity  of  illness  and  other  variables.  An  example  of  this  kind  of  study  is 
List's  1983  study  of  management  of  myocardial  infarction  in  Maryland  and  Oregon. 
Such  studies  can  shed  great  light  on  the  relative  efficiencies  of  different 
kinds  of  hospitals;  with  such  information  on  relative  efficiency,  relative  costs 
can  be  used  as  indicators  of  case  mix  differences. 

The  effect  of  either  of  these  strategies  would  be  to  permit  us  to 
differentiate  with  greatly  improved  accuracy  between  hospitals  which  are  winners 
and  losers  under  PPS  because  of  unmeasured  case  mix  variation  and  hospitals 
which  are  winners  and  losers  because  they  are  efficient  or  inefficient. 

Although  some  of  the  strategies  cannot  be  applied  directly  to  DoD  facilities, 
many  of  the  results  could  be. 

4.  Long-Term:  Long-term  strategies  rest  on  more  sophisticated  concepts 
of  case  mix.  These  strategies  assume  that  by  the  end  of  the  intermediate  phase 
we  will  have  a  system  of  case  mix  analysis  which  characterizes  hospitals  on  the 
basis  of  the  patient's  condition.  However,  factors  other  than  patien1,  condition 
influence  treatment  needs  and  should  be  measured  in  a  more  sophisticated  system: 
a.  j_he  purpose  of  treatment : 

(1)  A  patient  admitted  for  terminal  care  will  receive  very  dif¬ 
ferent  care  from  a  patient  whose  physician  is  determined  to  save  him.  A  recent 
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article  by  Garber  et  al  in  the  New  England  Journal  of  Medicine  observes  that  a 
significant  part  of  the  greater  costs  of  treatment  by  teaching  hospitals  may  be 
related  to  the  fact  that  in  a  non-teaching  setting  more  patients  with  a  high 
risk  of  dying  were  admitted  for  essentially  supportive  care.  It  is  interesting 
that,  although  the  teaching  patients  appeared  to  have  a  lower  in-hospital  mor¬ 
tality,  their  mortality  at  nine  months  follow-up  was  the  same,  suggesting  that 
the  teaching  facility  may  spend  much  more  effort  prolonging  the  inevitable. 

(2)  Treatment  goals  may  vary  widely  for  other  reasons.  For 
example,  two  patients  in  identical  condition  may  be  admitted  to  a  rehabilitation 
hospital  with  quite  different  rehabilitation  goals.  A  psychiatric  hospital  may 
set  very  different  goals  for  two  very  similar  schizophrenic  patients  depending 
on  the  degree  of  chronicity  and  response  to  previous  treatment.  An  oncology 
service  may  have  very  different  goals  for  a  specific  admission  of  patients  at 
the  same  disease  stage. 

b.  The  extent  of  knowledge  about  the  patient  at  admission:  When 
the  diagnosis  is  known  before  admission,  one  can  expect  less  for  diagnostic 
activity  and  diagnostic  cost  and,  since  the  admission  is  more  likely  to  be  for 
treatment,  we  can  expect  more  treatment  activity  and  treatment  cost.  Current 
systems  are  especially  insensitive  to  the  degree  of  diagnostic  uncertainty  in  a 
case. 

c.  Patterns  of  clinical  practice:  Well-trained  physicians  treat  simi¬ 
lar  patients  in  different  ways.  Some  operate  on  asymptomatic  gallstones,  others 
do  not;  some  hospitalize  for  unexplained  chest  pain,  others  do  not.  Thus,  there 
will  be  considerable  variation  in  the  treatment  defined  as  necessary  by  dif¬ 
ferent  experts . 

d.  Referral  Practices :  In  communities,  there  are  often  patterns  of 
referral  which  result  in  differential  assignment  of  the  most  difficult  or 
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treatment-resistant  cases  to  certain  hospitals.  A  dramatic  example  occurs  in 
many  communities  where  general  hospitals  skim  the  most  treatable  psychiatric 
patients  and  refer  or  commit  the  most  difficult  to  state  hospitals  or  similar 
institutions.  Often  the  decision-making  processes  are  extremely  intuitive  and 
difficult  to  quantify,  but  studies  of  the  communities  can  make  clear  that  such 
patterns  do  exist  and  are  significant. 

e.  Psychosocial  Characteristics  of  Patients:  Large  urban  hospitals 
have  argued  that  psychosocial  factors  (such  as  whether  patients  have  a  fixed 
address,  have  sufficient  money  to  alter  living  arrangements,  have  someone  at 
nome  to  care  for  them)  strongly  influence  length  of  stay  and  care  costs  for 
medical-surgical  patients.  Research  of  the  type  proposed  by  Arnold  Epstein  at 
Harvard/Bri gham  &  Women's  would  substantially  help  in  clarifying  these  issues. 

Although  this  variable  may  be  less  important  for  DoD's  active  duty  performance, 
it  probably  is  important  for  retirees. 

The  difficulties  of  pursuing  soft  variables  such  as  these  is  evident 
from  the  problems  which  have  beset  Horn's  efforts  to  move  from  her  initially 
promising  results  to  a  system  which  can  be  quantified,  summarized  adequately  in 
a  training  manual,  arid  reduced  to  abstractable  and  mathematically  summarizable 
elements.  Nevertheless,  we  have  every  reason  to  believe  that  the  variables 
listed  above  are  very  important,  and  we  would  have  to  be  extremely  suspicious 
that  a  system  which  purports  to  explain  a  very  large  portion  of  the  cost 
variation  without  taking  these  variables  into  account  may  be  measuring  actual 
treatment  rather  than  need  for  treatment. 

These  tasks  involve  developing  measures  for  new  variables  as  well 
as  collecting  data  to  measure  their  impact  and  their  importance  for  prospective 
payment.  Finally,  we  need  to  begin  t0  think  about  a  problem  raised  by  Garber's 
work:  how  do  we  reimburse  cases  when  some  physicians  would  believe  it  desirable 
to  keep  the  patient  alive  and  others  would  not  consider  that  goal  in  the 
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patient's  best  interest?  We  might  expect  social  forces  to  take  care  of  the 
problem,  but  recent  articles  in  the  New  England  Journal  confirm  a  widespread 
impression  that  physicians  are  providing  more  care  than  their  patients  want,  and 
economic  pressures  to  contain  that  trend  may  be  appropriate. 

5.  Capitation:  The  steps  of  refinement  discussed  above  are  important  to 
and  supportive  of  movement  toward  a  capitation  system,  because  a  competitive 
capitation  system  is  likely  to  need  methods  for  setting  capitation  rates  which 
are  more  precise  and  sensitive  than  the  methods  of  the  current  Medicare  Average 
Annual  Per  Capita  Charge  (AAPCC).  Each  of  the  steps  described  above  provides 
data  which  will  support  the  development  of  appropriate  patient-specific  capita¬ 
tion  rates.  If  HCFA  were  DoD  it  would  be  moving  very  quickly  in  this  direction. 

6.  How-to  Compare  Case  Mix  Systems:  With  this  in  mind,  I  would  like  to 
suggest  a  few  ideas  to  use  in  comparing  case  mix  systems  and  thinking  which  ones 
you  might  wish  to  adopt  and  which  to  engraft  onto  others. 

a.  Necessary  Data:  DRGs,  Patient  Management  Paths,  and  Staging  all 
run  on  the  Uniform  Hospital  Discharge  Data  Set.  This  is  a  major  advantage.  On 
the  other  hand,  this  data  set  has  very  little  clinical  texture.  APACHE  and 
MEOISGRPS  run  on  a  defined  clinical  data  set  which  could  be  collected  as  part  of 
discharge  abstracting  but  would  require  additional  effort  and  would,  in  con¬ 
sequence,  present  significant  problems  if  you  look  for  calibration  data  sets 
from  which  to  calculate  weights.  Severity,  which  is  the  most  impressionistic  or 
implicit  method,  requires  chart  reading. 

b.  Ability  to  Distinguish  Necessary  Care  from  Care  Rendered:  On  this 
score,  PMCs,  Staging,  APACHE,  and  MEDISGRPS  are  strongest.  However,  PMCs  and 
Staging  depend  on  great  detail  in  discharge  diagnoses,  and  this  may  well  be 
strongly  correlated  in  existing  data  sets  with  having  residents  and  interns  and 
hence  with  Leaching  status. 


DRGs  depend  heavily  on  whether  an  operating  room  procedure  was  per¬ 
formed  but  not  on  other  elements  of  care.  Because  the  weights  reflect  the  cost 
of  the  operating  room  procedures,  DRGs  are  actually  neutral  to  performance  of 
surgery,  which  may  be  an  appropriate  stance  in  the  absence  of  adequate  guide¬ 
lines.  DRGs  are,  however,  not  neutral  to  how  efficiently  the  surgery  was 
managed,  implicitly  rewarding  efficient  management. 

The  most  serious  criticism  of  Severity  of  Illness  is  that  it  depends 
so  heavily  on  the  '‘feel"  of  the  medical  record  that  it  may  actually  reflect  the 
care  rendered  rather  than  the  need  for  this  care.  Ultimately,  any  method  which 
did  this  would  return  to  cost-based  reimbursement,  or,  for  DoD's  purposes,  to 
saying  that  resources  used  equals  performance.  The  validity  of  this  criticism 
is  impossible  to  assess  at  this  time,  although  I  have  suggested  above  that  we 
must  have  concerns  about  any  system  which  explains  more  than  90%  of  variance 
without  including  certain  major  categories  of  information. 

c.  Objectivity:  Although  all  of  these  systems  except  Severity  of 
Illness  are  computerized  and  therefore  appear  objective,  there  is  always  con¬ 
siderable  leeway  in  the  nuance  of  diagnosis  and  in  whether  complications  (so 
important  to  DRGs  in  particular)  are  recorded.  Severity  has  a  high  rater 
reliability,  but  the  way  in  which  the  individual  scales  are  scored  is  still 
soft;  the  way  in  which  the  overall  score  is  determined  is,  in  some  ways,  a 
greater  concern  regarding  objectivity,  since  it  is  disturbing  that  the  combining 
of  the  scale  scores  has  evaded  summary  in  a  formula  for  several  years. 

d.  Clinical  Reasonableness:  Clearly,  the  chart-reading  methods  of  the 
Severity  index  are  the  most  clinically  reasonable  way  to  reach  a  conclusion. 

The  dependence  on  vital  signs  and  critical  findings  in  APACHE  and  MEDISGRPS  is 
highly  reasonable  for  conditions  where  emergency  admissions  and  critical  illness 
are  likely,  but  their  application  to  elective  surgery  and  non-critical  illness 


is  less  clear.  By  contrast,  many  clinicians  would  be  suspicious  of  the  adequacy 
of  systems  resting  exclusively  on  diagnosis  to  describe  the  variations  in 
necessary  care. 

d.  Neutral ity:  Various  systems  are  neutral  to  different  aspects  of 
medical  practice.  For  example,  DRGs  are  relatively  neutral  to  whether  surgery 
is  performed  while  staging  appears  to  impose  a  penalty  for  surgery.  DoD  will 
want,  I  think,  to  give  special  attention  to  whether  a  case  mix  system  is  as  sup¬ 
portive  of  full  restoration  to  duty  as  service  policy  may  dictate. 

E.  SUMMARY:  The  choice  of  a  case  mix  system  and  a  way  to  implement  it  pose 
very  complex  problems  for  DoD,  but  the  HCFA  experience  suggests  that  the 
problems  are  solvable  and  would  improve  DoD's  measurements  of  medical  activity 
performance. 


THE  IMPACT  OF  SEVERITY  OF  ILLNESS  ON  DRG  BASED  COST  ANALYSIS 

Susan  D.  Horn,  Ph.D.* 

It's  my  pleasure  to  be  here  this  afternoon  and  share  with  you  some  of  the 
work  we've  been  doing  on  quantifying  hospital  case  mix.  I'll  talk  mostly  from 
slides  because  I  think  it  will  help  to  bring  out  many  of  the  points  that  I'm 
trying  to  share  with  you  a  lot  more  clearly  than  if  I  just  tried  to  explain  it 
to  you  verbally.  We're  going  to  examine  the  connection  between  DRGs  and 
Severity  of  Illness.  Before  we  get  started  I  want  to  share  with  you  some  other 
case  mix  grouping  systems  just  to  go  over  their  definitions  so  that  you  can  see 
the  different  kinds  of  philosophies  that  people  have  used  to  try  to  quantify 
hospital  patients  and  what  the  products  of  the  hospital  are.  We'll  go  through 

the  definition  of  six  different  case  mix  groups,  but  our  concentration  will 

be  mostly  on  DRGs  and  Severity  of  Illnesses.  I  don't  think  that'll  be  much  of  a 

loss  in  terms  of  our  discussion  today,  because  research  has  been  able  to  show 
that  the  other  case  mix  grouping  systems,  the  ICD9  codes,  the  CPHA  PAS  List  A, 
Disease  Staging,  and,  although  we  are  not  quite  sure  about  Patient  Management 
Categories,  but  at  least  the  other  three,  research  has  shown  are  really  equiva¬ 
lent  to  the  467  DRGs  in  terms  of  their  ability  to  predict  resource  use. 

The  very  first  way  people  thought  about  grouping  hospital  patients  was  just 
using  the  ICD9  code  system  itself,  where  each  disease  is  given  a  specific  code. 
Aggregating  codes  at  the  three  digit  level,  we  end  up  with  1,000  codes  and  hence 
1,000  cells  to  group  patients  into.  At  the  four  digit  level,  we  have  10,000 
code  groups  and  now  with  the  new  ICD9-CM  system,  we  have  more  than  10,000  code 
groups.  That's  one  way  of  grouping  hospital  patients,  but  many  people  looked  at 
this  and  said,  "Well  that's  a  start,  but  there  can  be  much  more  wrong  with  a 

*  Edited  by  LTC  John  A.  Coventry  from  Dr.  Horn's  taped  presentation 
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patient  than  just  how  sick  they  are  in  what  level  they  have  been  classified  in 
terms  of  principle  diagnosis."  Many  people  felt  this  was  not  sufficient  to 
quantify  what  different  patients  were  in  different  hospitals.  CPHA  took  another 
approach  to  grouping  hospital  patients.  They  took  those  over  10,000  19  codes 

Figure  2 
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and  collapsed  them  down  to  398  principle  diagnosis  code  groups.  Then  to  answer 
some  of  the  criticism  of  the  other  systems  not  taking  into  account  secondary 
diagnosis,  etc.,  they  broke  each  one  of  those  code  groups  into  two  depending 
upon  presence  or  absence  of  an  operating  room  procedure.  Then  they  broke  them 


into  five  different  age  groups  and  two  more,  depending  upon  presence  or  absence 
of  a  secondary  diagnosis.  By  the  time  they  were  done  splitting,  they  ended  up 
with  almost  8,000  code  groups.  In  terms  of  numbers,  that  was  going  in  the 
right  direction.  Many  people  felt  10,000  was  too  many,  so  8,000  was  2,000  less 
than  that  and  we're  going  in  right  direction.  Other  people  looked  at  this  and 
said,  "You  have  just  asked  if  they  had  an  operation  or  not,  you  haven't  said 
what  kind  of  an  operation  it  is.  Also  you  just  ask  if  they  have  a  secondary 
diagnosis,  you  haven't  said  anything  about  the  type  of  secondary."  Maybe  we 
can  do  even  better  than  that  in  describing  differences  in  hospital  patients. 

Another  attempt  to  describe  hospital  patients  is  the  Disease  Staging 
System.  This  took  a  very  different  approach.  This  approach  was  trying  to  look 
at  the  seriousness  of  the  disease  and  in  that  sense  we're  getting  a  little 
closer  to  the  idea  of  severity.  They  quantified  seriousness  and  at  this  time 
I  believe  there  are  41?  diseases  that  have  been  staged.  Each  stage  is  supposed 
to  quantify  progressively  a  greater  physiological  extent  of  the  disease  in  the 
body.  (See  Figure  3)  The  idea  was  to  take  each  disease  condition  and  have  a  team 

Figure  3 
DISEASE  STAGING 

STAGE  0  -  NO  DISEASE  PRESENT 
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THE  FINAL  EVENT  OF  THE  ILLNESS 
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of  specialists  for  that  disease  condition  describe  the  patients  in  this  kind  of 
generic  grouping  system.  For  each  disease  condition  each  of  these  stages  are 
further  defined  in  terms  of  specific  conditions.  The  first  stage  is  no  disease 
present.  Stage  0.  Then  Stage  1  is  a  certain  diagnosis  but  no  local  or  systemic 
complications.  The  Stage  2  is  disease  processes  limited  to  an  organ  or  a 
system,  with  significantly  increased  risk  of  complications.  Stage  3  is  greater 
problems  than  Stage  2  and  finally  Stage  4  is  the  most  severe  or  death.  Although 
this  is  again  a  generic  way  of  describing  the  disease  condition  that  you  look  at 
in  terms  of  the  staging  criteria,  they  have  specific  descriptions  of  levels--how 
the  patient  would  look  as  a  Stage  1,  2,  3  or  4.  This  was  getting  closer  to  how 
sick  hospital  patients  were.  Many  people  felt  it  had  a  lot  of  positive  ramifi¬ 
cations.  People  looked  at  this  and  said  it  represented  a  progressively  greater 
physiological  extent  of  disease  in  the  body  but  they  also  said  it  only  included 
the  principal  diagnosis.  Patients  can  have,  in  addition,  other  kinds  of  disease 
conditions  that  also  need  to  be  treated  in  the  hospital,  so  although  we're 
getting  closer  to  how  sick  the  patient  is,  it  is  only  in  one  dimension.  We  need  a 
mechanism  of  looking  at  the  total  burden  of  illness  that  a  patient  presents;  not 
only  their  principal  diagnosis,  but  their  secondary  diagnoses  too.  In  addition, 
it  also  did  not  look  at  how  the  body  responded  to  it's  diseased  condition 
because  we  know  with  a  specific  disease  condition,  some  people  may  respond  very 
quickly,  and  other  people  may  respond  more  slowly.  So,  while  proceeding  in  the 
right  direction,  people  weren't  sure  it  was  the  complete  description  they  wanted 
for  hospital  patients. 

Another  different  approach  is  the  Patient  Management  Category  approach. 

(See  Figure  4)  This  one  we  haven't  seen  very  much  of  so  I  can  only  briefly 
describe  to  you  what  I  understand  it's  philosophy  is.  The  idea  behind  this 
system  is  to  look  at  final  diagnosis,  that's  the  discharge  diagnosis  or  the 
principal  diagnosis.  Also  look  at  the  reason  the  patient  was  admitted  to  the 


PATIENT  MANAGEMENT  CATEGORIES  (PMCs) 


PMCs  TAKE  INTO  ACCOUNT  BOTH  REASON  FOR  ADMISSION  AND  FINAL  DIAGNOSIS. 

FOR  EACH  PMC,  PHYSICIAN  PANELS  SPECIFIED  COMPONENTS  OF  CARE  (DIAGNOSTIC 
SERVICES,  TREATMENT  PROCEDURES,  AND  EXPECTED  LOS)  THAT  ARE  REQUIRED  FOR 
THE  TYPICAL  PATIENT.  THESE  COMPONENTS  OF  CARE  FORM  THE  BASIS  FOR  DERI¬ 
VATION  OF  RELATIVE  COST  WEIGHTS  FOR  EACH  PMC. 

hospital.  Then  for  each  one  of  those  combinations  ask  physicians  to  specify  the 
components  of  care  you  would  expect  in  the  care  of  that  patient,  both  diagnosti¬ 
cally,  and  therapeutically,  and  what  expected  length  of  stay  that  patient  would 
have.  So,  you  are  looking  at  a  typical  patient  coming  in  with  certain  symptoms 
and  having  a  final  diagnosis  and  then  asking  physicians,  what  would  you  typi¬ 
cally  do  to  them.  That's  why  they're  called  Patient  Management  Categories. 

To  date,  this  system  is  still  under  development  by  people  of  Blue  Cross  of 
Western  Pennsylvania.  We  don't  have  a  lot  of  descriptions  in  terms  of  what 
their  categories  look  like  except  that  we're  told  there  are  going  to  be  about 
750  groups.  In  terms  of  our  hierarchy  of  numbers  we  started  out  with  10,000 
19  codes  and  went  to  8,000  CPHA  groups.  Depending  upon  how  you  look  at  the 
Disease  Staging  system,  you  can  end  up  with  412  diseases  times  four  stages,  or 
about  1,600  groups.  Based  on  other  considerations,  I  understand  there  can  be  up 
to  3,300  groups  there.  Now  with  Patient  Management  Categories,  we  get  down  to 
750.  But  again  it's  a  very  different  philosophy.  It's  what  do  you  typically  do 
to  a  patient  that  describes  their  management  path. 

With  all  of  that  as  a  background  in  terms  of  different  ways  people  have 
looked  at  how  to  quantify  hospital  patients,  let's  look  at  what  the  DRG  people 
did.  Their  approach  was  very  different.  (See  Figure  5)  Rather  than  having 
people  speculate  on  what's  the  extent  of  the  disease  in  the  body,  or  speculate 
on  how  patients  are  treated,  or  automatically  put  patients  into  different 
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Figure  5 
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groups,  the  DRG  people  took  a  very  large  data  base  and  modeled  it.  They  took 
those  over  10,000  19  codes,  collapsed  them  down  to  23  organ  systems  or  Major 
Diagnostic  Categories  (MDCs);  then  took  about  1,500,000  cases  to  determine  how 
they  could  subdivide  those  cases  within  the  different  organ  systems  to  produce 
groups  that  had  similar  lengths  of  stay.  They  used  a  very  sophisticated  opera¬ 
tions  research  program,  an  optimization  program,  to  put  together  patients  that 
had  similar  lengths  of  stay.  They  took  into  account  three  aspects  of  the  pre¬ 
vious  systems  that  people  felt  may  have  limited  them.  They  took  into  account 
the  type  of  surgery  performed,  unlike  the  CPHA  system  which  automatically  split 
on  operation  versus  nonoperation.  Here  they  differentiated  on  what  type  of 
operation  it  was.  Secondly,  instead  of  just  noting  if  the  patient  had  a  secon¬ 
dary  diagnosis  or  not,  they  included  the  types  of  secondary.  They  made  a  spe¬ 
cial  list  of  the  complications  and/or  comorbidity  diagnoses,  called  the  C.C. 
list,  such  that  if  you  had  that  secondary  diagnosis  they  thought  that  you  were 
sicker  than  if  you  didn't  have  that  secondary.  Finally,  they  didn't  automati¬ 
cally  split  on  age.  They  only  split  on  age  when  they  felt  it  made  a  difference 
to  produce  groups  with  similar  lengths  of  stay.  The  philosophy  behind  the  DRG 


system,  I  think,  has  many  compelling  features.  The  philosophy  was  to  model  a 
data  set  to  see  what  the  data  was  telling  you;  to  use  very  high  powered  opera¬ 
tions  research  techniques  and  to  take  into  account  the  criticisms  of  the  other 
grouping  systems  by  looking  at  additional  factors  that  would  affect  the  severity 
of  the  patients. 

As  an  example  to  show  you  they  really  did  use  these  criteria,  MDC  1, 
the  Nervous  System,  divides  into  35  DRGs.  (See  Figure  6)  The  first  three  DRGs 

Figure  6 

MDC  1  -  NERVOUS  SYSTEM  (DRGs  1  to  35) 

DRG  1  -  CRANIOTOMY  EXCEPT  FOR  TRAUMA,  AGE  >=  18 
DRG  2  -  CRANIOTOMY  FOR  TRAUMA,  AGE  >=  18 
DRG  3  -  CRANIOTOMY,  AGE  <  18 

DRG  24  -  SEIZURE  &  HEADACHE,  AGE  >=  70  AND/OR  C.C. 

DRG  25  -  SEIZURE  &  HEADACHE,  AGE  18-69  W/0  C.C. 

DRG  26  -  SEIZURE  8,  HEADACHE,  AGE  0-17 

refer  to  a  specific  procedure.  Craniotomy.  If  the  patient  has  a  diagnosis 
that's  not  a  trauma  diagnosis  and  they're  over  17,  they  are  assigned  to  DRG  1. 

If  the  diagnosis  is  trauma  and  they're  over  17,  they  are  assigned  to  DRG  2.  If 
they're  under  18  they'll  go  into  DRG  3.  So,  you  see  how  the  type  of  procedures 
performed,  type  of  diagnosis,  trauma  or  not,  and  age,  are  all  used  in  describing 
the  patients  in  that  area.  If  they  don't  have  a  procedure,  as  an  example 
seizure  and  headache,  they  take  age  into  account  and  or  complications.  You  see 
the  age  breaks  turn  out  to  be  over  70,  18  to  69,  or  under  17.  Then  we  consider 
with  or  without  secondary  diagnoses,  referring  to  the  complication/comorbidity 
list.  They  really  did  use  their  definitions  and  stick  to  their  criteria  and  the 
fact  is,  by  using  this  high  powered  operations  research  technique  and  modeling 
their  data,  they  were  able  to  produce  467  DRGs  that  were  equal  in  their  ability 
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to  describe  hospital  resource  use  to  the  other  systems  that  had  many  more 
groups.  So  I  think  the  power  of  that  system  and  the  techniques  they  used  has 
real ly  come  through . 

In  contrast  to  all  of  these  systems,  let  me  explain  to  you  how  we  devised 
the  Severity  of  Illness  index.  This  index  actually  started  back  in  the  mid 
1970's.  Maryland  had  formed  a  cost  review  commission  in  the  early  1970 ' s . 

Unlike  the  state  of  New  Jersey,  where  the  commission  went  to  each  hospital 
and  said,  "All  our  patients  are  going  to  be  grouped  by  DRGs  and  then  we're  going 
to  pay  you  on  the  basis  of  DRGs,"  in  Maryland  the  head  of  the  commission  said 
to  the  hospitals,  "You  choose  the  case  mix  grouping  system  you  want,  but  once 
you've  chosen  it  you  have  to  stay  with  it  for  future  perspective  reimbursement." 
I  thought  that  was  kind  of  silly  because  I  realized  all  hospitals  would  be 
using  different  systems  in  the  state  of  Maryland  and  we  wouldn't  have  a  lot  of 
comparative  data.  But  now  that  I  look  at  it  many  years  later,  I  think  that  it 
was  probably  a  rather  brilliant  move  on  the  part  of  the  director  of  our  com¬ 
mission.  He  didn't  want  the  hospitals  coming  back  to  him  and  saying  "I  don't 
like  the  case  mix  grouping  system  you  use,"  and  talking  about  all  the  problems 
with  it.  Hopkins  Hospital  looked  at  the  case  mix  systems  that  existed  at  the 
time.  They  looked  at  the  ICD9  code  system  itself.  They  looked  at  the  CPHA 
system  that  existed  then.  They  looked  at  the  383  DRGs  that  existed  then.  The 
criteria  that  they  were  looking  for,  in  terms  of  a  good  grouping  system,  was 
homogeneity  of  resource  use--having  patients  in  a  group  that  required  similar 
resource  use.  What  they  found,  however,  was  whether  they  grouped  by  ICD9  codes 
or  CPHA  or  DRGs,  many  groups  had  patients  with  huge  variability  in  resource  use, 
some  requiring  $1,000  in  resource  use  with  other  patients  in  the  same  group 
requiring  over  $200,000  in  resource  use.  When  they  saw  this  kind  of  variabil¬ 
ity,  they  called  me  because  they  thought  it  was  a  statistical  problem.  I 
looked  at  their  data  and  I  saw  just  what  they  saw  --  huge  variability  of 
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resource  use  within  these  groups.  Very  often  the  standard  deviations  were 
larger  than  the  means  and  consequently  all  the  grouping  systems,  no  matter  which 
one  they  used,  had  problems.  They  ultimately  decided  to  investigate  the  DRG 
system  because  all  of  them,  as  I  mentioned,  had  problems  and  the  DRGs  had  the 
fewest  number  of  groups.  But  at  the  same  time  they  needed  to  figure  out  why 
it  wasn't  working.  So  we  took  some  of  the  DRGs  with  cases  that  had  low  resource 
use  in  a  DRG  and  high  resource  use  in  the  same  DRG.  We  asked  physicians  to  look 
at  these  cases.  The  physicians  came  back  and  said,  "It's  obvious.  The  patients 
with  $1,000  of  resource  use  run  into  no  complications,  have  no  other  diseases, 
respond  promptly  to  therapy  and  go  out  of  the  hospital.  On  the  other  hand, 
patients  with  $200,000  in  resource  use  suffered  complications  and  other 
diseases."  They  said  we  couldn't  think  of  them  in  the  same  group. 

That's  what  led  us  to  the  idea  of  trying  to  conceptual i ze  this  difference  of 
no  problems  versus  lots  of  problems.  We  gathered  a  team  of  physicians  and  nur¬ 
ses  together  and  said,  "Please  tell  us  all  the  dimensions  you  think  about  when 
you  go  to  a  colleague  and  say  I'm  treating  a  sick  patient.  What  is  it  that  you 
conjure  up  in  your  mind?"  Now  we  had  certain  goals  of  where  we  were  heading. 

We  had  the  goal  of  trying  to  produce  a  system  of  describing  how  sick  these 
patients  were.  That  was  simple.  That  was  something  of  a  four-category  system. 
The  physicians  wanted  a  five-category  system.  But  I  was  very  concerned  about  a 
five-category  system.  If  you  allowed  a  middle  value,  I  thought  we'd  find  85%  of 
our  cases  in  the  middle  and  we  wouldn't  have  distinguished  anything.  So  that's 
how  we  ended  up  with  the  four-category  system.  And  then  we  wanted  something 
that  could  be  used  across  all  disease  conditions.  We  wanted  a  generic  type 
system  and  we  wanted  something  that  was  simple  enough  so  that  we  wouldn't  have 
to  have  specialty  people  using  the  instrument.  Our  idea  was  that  if  we  could 
br-a*  tnis  idea  of  severity  down  into  sjbparts,  maybe  that  would  be  a  useful  w3y 
tj  bu’!d  a  system  that  would  be  reliable.  In  other  words,  people  with  very 
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little  medical  education  could  score  the  same  severity  as  somebody  with  a  lot  of 
medical  education.  So  what  we  ultimately  ended  up  with  is  what  you  see  in 
Figure  7;  seven  dimensions  tha..  are  used  as  a  guide  to  come  up  with  the  overall 
severity  of  illness.  Rather  than  just  asking  "How  sick  do  you  think  this 
patient  is,  1,  2,  3,  or  4?",  we  said,  "Please  answer  seven  questions  about  this 
person  first."  The  way  I  envision  it  is  like  a  CAT  scan  that  looks  at  a  patient 
from  a  variety  of  different  angles  and  then  tries  to  put  the  whole  picture 
together  in  terms  of  what  it's  seeing.  That's  what  we're  trying  to  do  in  our 
seven  dimensions  of  severity;  look  at  a  patient  in  seven  different  ways,  many  of 
which  are  related  to  each  other. 


Figure  7 

PATIENT  SEVERITY  INDEX 
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Let's  look  at  the  seven  dimensions  we  took  into  account.  The  first  was  how 
sick  were  they  in  the  principal  diagnosis.  Their  stage  of  principal  diagnosis. 
But  unlike  the  Disease  Staging  system  that  had  disease  specific  criteria,  since 
we  wanted  this  to  be  generic,  we  had  to  define  stage  in  terms  of  symptoms;  for 
example,  from  an  elective  surgery  patient  being  asymptomatic  up  to  patients 
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having  catastrophic  symptoms  such  as  coma  or  arrest.  Then  we  looked  at  the 
extent  of  compl ications,  from  none  or  very  minor  types  of  complications  that 
develop  after  the  patient  gets  into  the  hospital  that  really  don't  influence 
their  hospitalization  at  all,  up  to  something  catastrophic.  For  example,  a 
patient  could  be  admitted  in  respiratory  distress  but  then  go  into  pulmonary 
failure.  Interactions  looks  at  what  other  diseases  the  patient  has  and  to  what 
extent  they  get  out  of  line  while  the  patient  is  in  the  hospital.  They  can  have 
a  variety  of  other  diseases  but  if  they  stay  under  control  and  if  they  continue 
with  their  regular  maintenance  therapies,  that's  going  to  be  a  Level  1.  In  a 
patient  who  comes  in  for  diabetes  but  also  has  a  renal  problem  as  an  underlying 
condition,  that  for  us  would  be  an  interacting  condition.  If  they  went  into 
total  renal  shutdown,  that  would  be  a  catastrophic  interaction  level. 

Dependency  and  nonoperating  procedures  are  two  other  dimensions  that  we 
added  as  internal  monitoring  checks  in  the  instrument.  They  are  two  dimensions 
that  I  wasn't  sure  we  really  should  include  in  the  instrument  but  the  team  said 
when  you're  looking  at  an  instrument  and  you're  asking  people  to  answer 
questions,  you  should  have  some  questions  that  force  them  to  go  back  and  ask 
"Did  I  answer  the  parts  up  above  correctly?"  That's  what  I  mean  by  an  internal 
monitoring  check.  So  we  define  dependency  in  the  following  way.  Unlike  the 
usual  acuity  systems  that  define  dependency  to  be  what  types  of  tasks  you  do 
for  a  patient,  dependency  here  is  defined  to  be  the  usual  amount  of  care  for 
that  disease  condition  which  could  be  from  a  low  level  up  to  having  more  than 
the  usual  amount  of  care.  Now  the  only  way  someone  can  get  more  than  the  usual 
amount  of  care  for  a  disease  condition  is  if  something  has  gone  wrong  up  above. 

Either  they're  at  a  significantly  higher  stage  in  their  disease  or  some  compli¬ 
cations  develop  that  need  extra  care  or  some  diseases  are  out  of  line  that  need 
much  more  care.  That's  why  this  is  an  internal  monitoring  check.  If  by  reading 
through  the  nursing  notes,  you  think  that  the  patient  is  getting  more  than  the 
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usual  amount  of  care,  you  should  see  it  reflected  up  above.  On  the  other  hand, 
if  you  don't  see  it  reflected  up  above,  either  you  may  have  missed  something  up 
above  or  you've  misread  the  nursing  notes.  So  you  see  how  it's  an  internal 
check  within  the  instrument  itself  and  we  find  that  it  doesn't  drive  severity  at 
all.  When  we  do  our  regression  equations  to  see  the  effects  of  these  seven 
dimensions  on  predicting  severity,  it's  always  an  insignificant  variable  because 
of  the  way  it's  defined.  The  same  thing  was  true  with  the  variable  non¬ 
operating  procedures.  It  was  put  in  there  to  distinguish  patients  that  went 
onto  life  support  versus  non-life  support.  When  a  patient  went  onto  life  sup¬ 
port,  the  team  felt  that  it  really  meant  the  patient  was  very  sick.  Something 
really  had  to  be  wrong.  Life  support  are  things  like  renal  dialysis,  vent¬ 
ilator,  etc.  If  a  patient  went  onto  life  support,  again  you  should  see  it 
reflected  in  either  something  major  or  something  catastrophic  up  above.  Again, 
you  see  how  it's  an  internal  monitoring  check.  In  fact,  it's  a  variable  that 
trained  people  very  often  feel  is  one  of  the  variables  that's  going  in  the  wrong 
direction  in  our  severity  instrument.  By  that  I  mean  the  following:  Very  often 
there  are  patients  at  Level  3  or  4  overall  in  terms  of  how  sick  they  are,  that 
don't  have  any  life  support  procedures.  Hence,  no  matter  how  many  procedures 
are  done  to  them  in  terms  of  non-operating  room  procedures,  they  can't  get  above 
a  Level  2  on  this  dimension  unless  they've  gone  onto  life  support. 

Consequently,  most  of  the  time  when  we  see  this  variable  working,  it  doesn't 
influence  things  at  al '  .  Again  it's  to  check  to  see  if  they've  gone  onto  life 
support  has  it  been  picked  up  above.  Very  often  what  happens  with  that  variable 
is  it's  only  used  for  that  purpose  and  completely  ignored  when  the  overall  pat¬ 
tern  is  examined.  Most  of  the  time,  frankly,  we  find  these  Level  3  and  4 
patients  don't  have  that  kind  of  life  support  procedure. 


finally,  we  look  at  response  to  therapy.  How  quickly  a  patient  responds  is 


defined  in  the  following  way.  Prompt  means  the  best  response  you  can  expect-- 
one  course  of  therapy  and  the  patient  gets  better.  Moderate  delay  means  they  do 
respond,  the  slope  is  positive,  but  it's  much  slower  and  you  have  repeat  courses 
of  therapy  that  take  much  longer.  Serious  delay  is  also  easy  to  detect,  you 
have  setbacks.  The  patient  gets  better  and  gets  worse,  gets  better  and  gets 
worse.  Finally,  no  response  is  also  easy  to  detect.  The  last  dimension  is  a 
remission  of  their  acute  symptoms,  which  we  called  res i dual .  What  were  looking 
for  when  a  patient  is  hospitalized  in  terms  of  their  acute  care  is  remission  of 
their  acute  symptoms  that  brought  them  into  the  hospital.  For  example,  when 
someone  comes  in  with  emphysema  and  in  respiratory  distress,  we  know  we're  not 
going  to  cure  the  emphysema,  but  what  we're  trying  to  cure  is  their  respiratory 
distress . 

So  these  are  the  seven  dimensions,  the  seven  questions  asked  about  a 
patient.  Each  of  them  is  answered  and  then  a  pattern  appears.  What  typically 
happens  is  for  those  patients  who  don't  run  into  any  problems,  have  no  complica¬ 
tions  or  other  diseases,  and  respond  promptly  to  therapy,  they're  going  to  end 
up  a  Severity  Level  1.  Those  that  run  into  some  kind  of  problems,  (the  driving 
variables  turn  out  to  be  complications,  interactions  and  response  to  therapy) 
are  going  to  end  up  as  Level  2's.  Running  into  major  problems,  they're  going  to 
be  at  Level  3  severity.  Catastrophic  problems  are  going  to  be  Level  4.  That, 
in  general,  is  how  the  instrument  works. 

I  can  share  with  you  very  quickly  four  different  patients  that  end  up  at 
four  different  severity  levels  and  you  can  see  a  bit  more  clearly  how  these  cri¬ 
teria  are  used.  I  happen  to  remember  these  four  patients  because  we  were  using 
them  last  week  in  explaining  this  instrument  to  the  people  in  the  Maryland  Cost 
Review  Commission.  Hopkins  is  negotiating  with  the  Cost  Review  Commission  to  be 
reimbursed  on  the  basis  of  the  seventy  adjusted  DRG  system.  All  four  of  these 
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patients  were  Parkinson's  disease  patients  because  we  happened  to  be  working 
with  neurology  at  the  time.  The  first  patient  came  in  with  tremors  from 
Parkinson's  disease  and  was  considered  moderate  in  terms  of  stage  of  disease. 

He  had  no  comp! ications  so  he  was  at  Level  1  there.  The  patient  had  a  history 
of  peptic  ulcer  which  was  kept  under  control  with  diet  so  that  was  considered 
none  in  terms  of  interactions .  He  got  the  usual  amount  of  nursing  care.  They 
did  no  special  testing  for  the  patient.  They  titrated  the  patient's  medication, 
the  patient  promptly  responded,  the  tremors  went  away,  and  the  pattern  was 
essentially  a  Severity  Level  1.  The  second  patient  was  a  gentleman  who  also 
came  in  with  tremors  and  also  had  his  medications  titrated.  But  this  patient 
had  a  history  of  heart  problems  and  three  days  into  his  stay  developed  chest 
pains.  They  were  detected  quickly  and  it  was  realized  that  this  patient  was 
going  into  congestive  heart  failure.  Because  of  that  congestive  heart  failure, 
they  had  to  do  considerably  more  testing  for  the  patient.  They  also  had  to  get 
that  congestive  heart  failure  under  control  first  before  they  could  get  the  tre¬ 
mors  under  control,  which  signif icantly  delayed  his  response  in  terms  of  his 
tremors  getting  under  control.  He  had  no  other  complications  but  for  us  the 
congestive  heart  failure  was  considered  a  moderate  interaction  and  that  patient 
went  out  an  overall  Severity  Level  2.  Now  that  patient  also  brings  up  another 
feature  of  the  severity  index--the  severity  index  does  not  look  at  quality  of 
care.  At  the  time  this  patient  was  being  explained  to  the  Commission,  the  nurse 
who  was  explaining  said  she  has  been  to  other  places  across  the  country  where 
hospitals  have  been  collecting  severity  data.  In  some  of  those  institutions, 
they  might  not  have  caught  that  congestive  heart  failure  as  quickly  as  they  did 
at  Hopkins.  Consequently,  that  patient  could  have  deteriorated  into  much  more 
serious  congestive  heart  failure.  They  could  have  had  serious  set-backs  and 
could  have  required  much  more  than  the  usual  amount  of  nursing  care.  They  could 
have  even  had  to  have  some  type  of  life  support  procedure  and  that  patient  could 


have  gone  out  as  an  overall  Level  3.  So  quality  of  care  doesn't  come  into  this 
instrument.  We  just  ask  how  sick  is  that  patient,  we  don't  ask  how  they  got 
that  sick.  The  third  patient  also  came  in  with  tremors  but  this  was  a  sicker 
patient.  This  was  a  patient  who  had  a  history  of  renal  problems  and  went  into 
renal  failure  in  the  hospital.  That  was  considered  a  major  interaction  for  us. 
They  had  to  go  onto  dialysis  which  was  not  a  scheduled  dialysis.  As  a  result 
the  patient  had  a  very  up  and  down  course.  They'd  get  the  renal  failure  under 
control  and  the  tremors  would  go  out  of  line,  and  then  the  renal  failure  would 
go  out  of  line.  That  was  really  a  very  rocky  road  for  a  long  period  of  time. 
That  patient  went  out  as  an  overall  Level  3.  And  finally,  the  last  patient.  It 
was  really  debatable  that  they  called  it  a  Parkinson's  disease  case  in  the  first 
place  because  this  was  an  elderly  gentleman  who,  in  addition  to  his  Parkinson's, 
had  cancer  of  the  liver.  While  in  the  hospital,  he  had  a  stroke,  had  heart 
failure,  had  a  total  respiratory  failure  in  addition  to  the  heart  failure,  and 
ended  up  dying.  So  at  the  end  of  our  scale,  that  patient  was  really  considered 
catastrophic  and  an  overall  Severity  Level  4.  So  you  see  how  different  those 
four  patients  are  in  terms  of  the  straight  forward  one  not  running  into  any 
problems,  the  one  that  had  the  congestive  heart  failure  that  was  caught,  the  one 
that  had  renal  problems,  and  the  one  that  had  all  the  other  disease  conditions 
and  died.  That's  what  we're  trying  to  differentiate  with  severity  on  this  index 
These  seven  dimensions  about  the  patient  are  taken  into  account  to  come  up 
with  an  overall  Severity  Level  of  1,  2,  3,  or  4,  and  then  this  data  is  merged 
with  discharge  abstract  data  so  that  we  can  analyze  each  of  these  severity 
groups  divided  into  three  parts:  those  patients  with  no  operating  room  proce¬ 
dure;  those  with  a  moderate  procedure;  and  those  with  a  major  procedure.  (See 
Figure  8)  Operating  room  procedures  in  themselves  in  no  way  influence  the 
severity  level,  but  the  data  can  be  analyzed  separately  by  what  type  of  proce¬ 
dure  is  done.  In  that  sense,  when  people  are  first  learning  about  severity  it's 


a  little  bit  distressing  to  them.  I  think  that  possibly  our  labeling  of  this 
index  as  a  "Severity  of  Illness  Index"  may  be  a  misnomer.  Perhaps  we  should 
have  labeled  this  a  "Deviation  from  the  Minimum  Index,"  because,  for  example, 
you  can  have  a  Severity  Level  1  open  heart  surgery  patient.  The  reason  they're 
going  to  be  at  Severity  Level  1,  although  that's  a  very  serious  major  operating 
room  procedure,  is  because  they're  not  going  to  run  into  complications,  they're 
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not  going  to  have  other  diseases  getting  out  of  line,  and  they're  going  to 
respond  properly  to  therapy.  But  we're  not  going  to  compare  Severity  Level  1 
open  heart  patients  with  Severity  Level  1  OB  patients.  That's  how  this  instru¬ 
ment  has  been  able  to  be  used  in  all  those  different  areas.  It  really  is  a 
deviation  from  the  minimum,  the  way  we  use  it.  In  terms  of  moderate  versus 
major  procedures,  we  had  a  team  of  surgeons  at  Hopkins  look  through  the  19  pro¬ 
cedure  code  book.  They  ended  up  with  about  15%  of  the  procedures  they  thought 
were  major  operating  room  procedures  and  85%  were  put  in  the  moderate  category. 
A  major  procedure  was  a  surgery  that  took  a  long  time  to  recover  from  or  took  a 
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great  deal  of  skill  to  perform--things  like  open  heart  surgeries,  total  hip 
replacements,  organ  transplants,  etc.  All  the  other  operating  room  procedures 
we  classify  as  moderate. 

In  summary,  we’ve  been  looking  at  a  variety  of  different  ways  to  describe 
hospital  patients.  They're  really  based  on  two  different  data  sets,  the 
discharge  abstract  data  or  the  total  chart.  (See  Figure  9)  All  the  other  case 
mix  systems,  even  those  that  were  originally  developed  by  looking  at  the  total 
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chart,  have  been  implemented  using  the  discharge  abstract  data  set,  including 
the  DRGs.  We  felt,  when  we  were  conceptualizing  severity,  that  abstract  data 
was  just  not  rich  enough  to  describe  how  sick  patients  were.  When  you  think 
about  it  historically,  the  discharge  abstract  data  was  developed  back  in  the  mid 
1960's  when  people  were  asking  epidemiologic  questions  about  hospitals.  They 
were  asking  counting  questions,  such  as  how  many  of  this  disease  did  you  treaf, 
how  many  of  that  procedure  did  you  perform,  or  how  old  were  your  patients.  They 
weren't  asking  questions  about  how  sick  are  your  patients  or  what  resources  do 
they  require.  Consequently,  we  feel  the  labels  that  exist  in  the  current 
discharge  abstract  data  set--  principal  and  secondary  diagnoses,  procedures  and 


CHART  DATA 
SEVERITY  OF  ILLNESS 
STAGING 

PATIENT  MANAGEMENT 
CATEGORIES 


52 


r 


'  V  1  ^ 


« 


I 

r. 

f 


« 


age--are  just  not  sufficiently  rich  to  describe  how  sick  patients  are.  That's 
why  when  we  conceptualized  severity  we  based  it  on  the  total  chart.  But  that  is 
one  of  its  limitations  because  to  collect  severity  data  people  have  to  go  back 
to  the  total  medical  record.  I'll  share  with  you  how  they  do  it  now.  It's  very 
efficient  the  way  they  do  it,  but  still  it  is  a  limitation.  In  our  current 
research  we  are  developing  a  data  base  that  I  view  as  a  bridge  between  these  two, 
an  expanded  discharge  abstract  data  base.  (See  Figure  10)  It's  really  based  on 
two  steps.  First  we're  taking  the  current  19  code  book  and  adding  a  sixth 
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digit  to  each  of  the  five  digit  codes.  The  sixth  digit  for  each  disease  will 
tell  how  sick  the  patient  is  in  that  disease  category,  whether  they're  at 
Severity  Level  1,  which  is  the  least  or  most  minor  symptoms  for  that  disease  up 
to  a  Level  4,  the  most  severe  symptoms  for  that  disease.  Each  diagnosis  will 
have  a  sixth  digit  and  the  new  discharge  abstract  data  set  can  have  a  sixth 
digit  for  the  principal  diagnosis  and  six  digits  for  each  of  the  secondary 
diagnoses.  We'll  still  use  procedure,  age,  sex,  and  discharge  status,  and  add 
as  another  variable  into  this  data  set  a  rate  of  response  to  therapy.  With 
that  as  our  new  expanded  discharge  abstract  data  set,  a  computer  formula  just 


like  the  DRG  formula  can  be  used  to  weight  those  different  values  and  come  up 
with  an  overall  severity  for  the  patient.  Also,  with  that  same  data  set,  DRG 
people  can  redo  their  DRG  AUTOGRPing,  the  Disease  Staging  people  can  recode 
their  system,  the  Patient  Management  Category  people  can  recode  their  system, 
and  we'll  all  have  a  richer  data  base  to  work  with.  But  that's  about  another 
year  in  the  completion  process. 

The  data  I'm  going  to  share  with  you  today  has  been  collected  using  the 
manual  Severity  of  Illness  system,  the  one  I  just  shared  with  you.  It's  been 
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collected  by  medical  records  and  utilization  review  personnel  because  our  stud¬ 
ies  originally  showed  that  whether  a  physician  specialist  rated  the  case  or 
a  physician  generalist,  like  an  internist  or  general  surgeon,  or  a  nurse  spe¬ 
cialist  or  a  nurse  generalist,  or  a  medical  records  person,  they  all  would  come 
up  reliably  with  the  same  overall  level  of  severity  because  of  the  quide  of 
those  seven  dimensions.  So  now,  on  an  ongoing  basis,  it's  the  medical  records 
or  utilization  review  personnel  that  collect  this  data  along  witn  their  usual 
discharge  abstract  coding,  or  discharge  utilization  review  reporting.  Thev  fine 
that  it  takes  them  almost  no  extra  time  to  collect  that  data  when  they're  doing 


54 


it  in  the  process  of  looking  at  the  chart  for  another  purpose.  So  it's  a  very 
efficient  system  to  implement  in  a  hospital.  They're  trained  in  the  defini¬ 
tions  in  a  three  day  training  program  and  that's  followed  on  a  monthly  basis 
for  the  first  three  months  with  reliability  follow-on  training,  and  quarterly 
thereafter.  That's  how  we're  sure  everybody  collecting  severity  data  across  the 
country  is  rating  the  severity  at  the  same  level.  In  other  words  what  they’re 
calling  a  Level  1  in  Boston  is  what  they're  calling  a  Level  1  in  California 
and  all  across  the  country  in  all  of  the  30  or  so  hospitals  that  are  now 
collecting  severity  data  on  an  ongoing  basis.  To  be  sure  the  data  is  reliable, 
we  go  in  and  recode  blindly  samples  of  the  cases  that  have  been  coded  within  a 
hospital.  We're  getting  between  90%  and  100%  agreement  rates.  We  know  that 
those  levels  are  comparable  across  the  country.  So  that's  the  kind  of  data  that 
I'm  going  to  share  with  you  this  afternoon;  data  collected  from  about  30  dif¬ 
ferent  institutions  on  about  200,000  cases.  I'm  not  going  to  show  you  all 
200,000  but  I'm  going  to  show  you  some  examples  from  them. 

Before  we  look  at  some  example  data,  we've  got  one  other  piece  of  house¬ 
keeping  that  we  have  to  take  care  of  and  that  is  to  look  at  some  statistical 
definitions.  I  was  telling  you  earlier  that  Hopkins'  criteria  for  a  good 
grouping  system  is  homogeneity,  but  I  really  didn't  define  it  for  you. 
Homogeneity  means  similarity.  In  this  case  we're  looking  at  similarity  of  total 
charges  or  similarity  of  resource  use.  There  are  three  statistical  tests  to 
help  us  evaluate  homogeneity.  'See  Figure  12)  I'm  going  to  use  all  of  them  in 
my  discussion  and  you  should  see  what  the  differences  are  in  terms  of  the  defi¬ 
nitions.  The  three  different  types  of  homogeneity  statistics  are  Reduction  in 
Variance  (RIV),  Coefficient  of  Variation  (CV),  and  the  Analysis  of  Variance 
(ANOVA)  F  test.  Each  of  them  are  slightly  different  in  their  approach  to  asking 
how  good  is  a  group. 


Figure  12 
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The  first  measure  of  homogenity  is  Reduction  in  Variance.  Incidentally, 
tnis  was  the  criteria  the  Yale  people  used  to  produce  the  DRGs.  This  asks  the 
question,  how  variable  is  your  group  to  start  with,  and  then  if  you  divide  that 
group  into  different  subsets,  how  small  is  the  variability  within  the  subsets. 
Now,  when  you  break  a  large  group  into  subsets  you  don't  automatically  get  a 
better  group  because  if  you  don't  do  such  a  good  job  when  you  break  the  group 
up,  you  can  take  some  patients  from  one  end  of  the  scale  and  some  patients  from 
the  opposite  end  of  the  scale  and  put  them  into  a  new  group.  You  can  repeat 
this  process  using  data  from  opposite  ends  of  the  scale  and  put  them  into  a 
second  new  group  and  you  can  form  lots  of  new  groups  but  none  of  them  are  any 
better  than  what  you  started  with.  When  you  look  at  this  statistical ly,  it 
means  the  variability  you  started  with  is  large  (TSSQ)  and  the  variability  in 
all  your  subgroups  you  put  together  is  also  large  (TWGSSQ).  The  difference  in 
tne  numerator  is  0  and  the  RIV  is  very  small.  That's  when  you  don't  do  such  a 
good  job.  But  when  you  do  a  good  job  you  start  with  a  lot  of  variability  (TSSQ) 
but  each  of  your  final  groups  takes  patients  that  are  close  together  in  terms  of 
resource  use.  The  total  variation  within  groups  (TWGSSQ)  is  much  smaller  so  you 
take  a  large  number  minus  a  small  number  and  get  a  large  number  over  a  large 
number  which  results  in  RIV  of  almost  100%.  So  large  Reduction  in  Variance  is 
what  we  look  for  in  terms  of  a  good  grouping  system.  We've  taken  a  large  amount 
of  variability  and  reduced  it  down  to  a  small  amount  variability. 


A  second  way  to  evaluate  groupings  is  to  look  at  the  Coefficient  of 
Variation.  That  asks  for  each  individual  group  how  variable  is  the  group  com¬ 
pared  to  the  magnitude  of  what  you're  measuring.  Many  people  look  at  that  and 
wonder  why  we  have  to  compare  variation  to  what  we're  measuring.  Why  don't  we 
just  look  at  how  variable  is  the  group  using  the  standard  deviation?  If  you 
keep  in  mind  the  following  example  I  think  you’ll  see  why  you  have  to  divide  by 
something.  If  you  had  a  group  of  patients  whose  average  resource  use  was  $2,000 
and  they  vary  plus  or  minus  $400,  wouldn’t  that  appear  very  different  to  you 
from  a  group  of  patients  whose  average  resource  use  was  $100,000  and  who  also 
vary  plus  or  minus  $400?  You  see  how  variability  depends  very  much  on  what 
you're  measuring,  and  that's  just  what  CV  looks  at.  How  variable  is  the  group 
compared  to  the  average  of  what  you're  measuring?  Our  goal  now  is  to  have  a  low 
CV. 

The  last  measure  is  the  ANOVA  F  test.  This  test  suggests  that  if  you've  put 
together  a  lot  of  subgroups  that  have  the  same  mean  and  same  amount  of  variabil¬ 
ity  as  you  started  out  with  why  do  you  bother.  What  you  want  in  this  case  is  a 
large  F  value  which  means  that  some  groups  you've  produced  are  very  different 
from  the  other  groups  and  the  variability  within  the  groups  is  small. 

Those  are  three  different  measures  of  homogeneity,  somewhat  related,  but 
really  different  in  a  sense  of  determining  how  good  are  the  groups.  An  example 
will  show  how  these  can  be  used.  This  is  a  contrived  example  to  bring  the  point 
home  very  clearly.  Figure  13  shows  a  data  set  within  eight  patients.  We're 
looking  at  their  lengths  of  stay  and  they  range  from  2  days  up  to  18  days. 
Suppose  these  are  all  gallbladder  patients  and  we  see  a  lot  of  variability  in 
this  group.  The  standard  deviation  is  7.8  days.  That  looks  pretty  clear 
because  the  m~*an  plus  one  standard  deviation  is  10  plus  7.8  or  roughly  18  days, 
and  10  minus  7.8  is  roughly  2  days.  Now  we're  going  to  break  that  group  into 
two  parts.  Suppose  we  look  at  those  eight  patients  and  determine  who  has  blonde 


Figure  13 

EXAMPLE  OF  GROUPING  HOMOGENEITY 
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hair  and  who  doesn't  have  blonde  hair  (Grouping  Method  A).  All  the  blondes  go 


into  group  one  and  all  the  non-blondes  go  into  group  two  and  we  look  at  their 
length  of  stay.  The  blondes'  lengths  of  stay  are  2,  3,  16,  and  18  and  the  non¬ 
blondes  are  2,  4,  17  and  18.  You  can  see  we've  broken  them  into  two  groups  but 
we  haven't  done  such  a  great  job  because  our  criteria  hasn't  made  a  lot  of 
sense.  The  means  in  these  groups  are  not  very  different  from  each  other  and 
their  variation  is  even  larger  than  it  was  before.  Coefficients  of  Variation 
are  also  quite  large  and  we  look  at  our  RIV  statistic  and  see  that  we  explained 
less  than  one  tenth  of  a  per  cent  of  the  variability  in  resource  use.  Now  we 
can  take  those  same  patients  and  group  them  based  on  those  without  a  secondary 
diagnosis  and  those  with  a  secondary  diagnosis  (Grouping  Method  B).  Suppose 
those  without  a  secondary  had  lengths  of  stay  2,  2,  3  and  4,  and  those  with  a 

secondary  had  lengths  of  stay  16,  17,  18  and  18.  Now  you  see  the  difference. 

The  means  are  very  different  from  each  other,  the  variability  and  the  standard 
deviation  is  much  smaller  within  each  group.  The  CVs  are  also  much  smaller  and 

the  RIV  is  98.7%.  So  you  see  how  we  can  take  a  group  and  divide  it.  One  way 

gets  you  some  place  and  the  other  doesn't.  Sc  I  wanted  you  to  see  how  these 
tnree  criteria  are  really  useful  in  determining  how  well  groups  are  put 
together . 

Let's  move  on  to  some  real  data.  First  I'll  share  with  you  some  data  from  a 
few  different  DRGs  that  I've  selected  for  two  purposes.  One  is  to  show  a 
variety  of  different  organ  systems  represented,  and  the  second  is  to  show  a 
variety  of  different  types  of  bias  we  found  in  the  DRG  system.  In  Figure  14 
DRGs  represent  the  nervous  system,  pulmonary,  circulatory,  musculoskeletal, 
nutrition  metabolic,  and  female  reproductive.  We  see  a  spread  of  severity 
across  these  cases.  What  happened  is  that  this  is  a  data  set  from  a  university 
teaching  hospital.  They  sent  us  discharge  abstract  data  merged  with  severity 
and  charge  data  so  I  could  run  this  through  the  DRG  grouper.  These  are  all  19 


UNIVERSITY  TEACHING  HOSPITAL 
I9/DRG  by  Severity 


60 


DRGs,  the  new  467,  so  I  put  them  into  the  approprite  DRGs  and  cross-classified 
them  by  severity.  For  example,  for  all  these  patients  that  had  craniotomy, 
except  for  trauma,  over  18,  DRG  1,  some  of  them  looked  straight  forward,  not 
running  into  any  complications  or  other  diseases,  and  were  assigned  Severity 
Level  1.  Twenty-two  patients  ended  up  in  that  category.  Fifty-four  of  them  ran 
into  some  kinds  of  problems  and  were  coded  at  the  Level  2  severity.  Twenty-one 
ran  into  more  major  problems  resulting  in  Level  3  severity.  Finally,  11  were 
catastrophic.  All  the  deaths  go  into  group  4,  but  there  can  be  live  4 1 s .  Those 
cases  where  they  try  to  save  patients  with  major  operations,  really  have  very 
great  financial  resource  impact  on  the  facility. 

Now  that  we've  seen  that  kind  of  spread  of  severity,  let  me  show  you  some 
of  the  financial  ramifications.  All  of  these  examples  have  been  taken  from 
different  hospitals  but  each  illustration  represents  only  a  single  hospital's 
data.  So  within  each  example  we  have  no  variability  on  different  costs,  the 
charge  ratios,  wage  rates,  etc.  Figure  15  is  an  example  of  DRG  75.  Forty-seven 

Figure  15 
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patients  in  this  DRG  averaged  $11,700  to  treat.  Patient  charges  ranged  from 
$1,000  to  more  than  $205,000.  This  is  one  of  the  DRGs  that  originally  stimu¬ 
lated  our  development  of  the  severity  index.  You'll  notice  the  Coefficient  of 
Variation,  is  2.51  or  251%,  meaning  the  standard  deviation  is  two  and  a  half 
times  the  size  of  the  mean.  Those  patients,  it  turns  out,  fell  into  four 
severity  levels  but  also  two  procedure  types.  Interestingly  enough,  even  though 
tne  ORGs  have  categorized  all  these  patients  as  major  chest  procedures,  when 
our  surgeons  loo^d  at  the  different  19  codes  some  of  them  were  categorized  as 
only  moderate.  Among  those  patients  with  a  moderate  procedure  we  still  saw 
three  levels  of  severity;  six  of  the  patients  were  Level  1  severity  averaging 
$2,600  to  treat,  11  of  them  ran  into  some  problems  designated  as  Severity  Level 
2,  averaging  $6,300  to  treat,  and  three  of  them  ran  into  major  problems  such  as 
some  kind  of  failure  as  we  were  discussed  before,  averaging  almost  $15,000  to 
treat.  So  even  though  they're  within  some  subset  of  a  DRG  having  what  we  con¬ 
sider  to  be  moderate  operating  room  procedures,  we  still  see  dramatic  differen¬ 
ces  in  resource  use.  Even  among  those  with  major  operating  room  procedures  we 
still  see  a  spread  of  severity.  Thirteen  of  them  were  Level  1.  That’s  again 
sometimes  puzzling  when  it's  a  major  procedure  but  they  can  be  Level  l's  because 
they  don't  run  into  complications,  don't  have  other  diseases,  and  respond 
promptly  to  therapy.  But  notice  the  difference  in  resource  use.  It's  almost 
$5,900  to  treat  a  Severity  Level  1  in  this  DRG  with  a  major  procedure  compared 
to  $2,600  to  treat  a  moderate  operating  room  procedure  Severity  Level  1  patient. 
Thirteen  of  them  were  Level  2  averaging  $10,500.  We  had  no  Level  3 ' s  but  had 
one  catastrophic  Level  4  patient.  I  think  all  of  us  could  consider  that  patient 
as  an  outlier  and  could  choose  to  ignore  that  in  this  data  set.  But  even 
ignoring  that  patient,  what  you  see  is  really  two  forms  of  bias  in  this  DRG. 

One  is  the  bias  of  type  of  procedure,  moderate  versus  major,  and  second  is  the 
bias  of  severity  within  that  procedure.  Notice  also  the  CVs  are  much  smaller 
here  than  what  we  started  out  with. 
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The  next  example,  shown  in  Figure  16,  goes  in  what  I  consider  to  be  the 
opposite  direction.  This  is  DRG  354,  Nonradical  Hysterectomy  either  among  the 
elderly  or  patients  with  a  comp] ication.  Looking  at  this  DRG  we  would  think  all 

Figure  16 

DRG  354  -  NON-RADICAL  HYSTERECTOMY 
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of  these  patients  would  have  moderate  operating  room  procedures.  Clearly  a 
nonradical  hysterectomy  is  a  moderate  operating  room  procedure.  So  it  was 
strange  when  we  found  five  patients  here  with  major  operating  room  procedures 
resulting  from  definitions  in  the  DRG  grouper.  It  looks  at  the  principal 
diagnosis  and  all  listed  procedures.  If  any  procedure  agrees  with  the  principal 
the  patient  goes  into  that  group  as  you  see  most  of  them  did.  But  it  turned 
out  for  these  five  women,  one  of  them  had  an  operation  on  the  pericardium,  two 
of  them  had  total  splenectomies,  and  two  had  total  colostomies,  which  we  con¬ 
sider  to  be  major  operating  room  procedures.  But  even  among  those  that  had  only 
a  moderate  operating  room  procedure,  you  again  see  dramatic  differences  in 
resource  use  by  severity  level.  Fifteen  of  them  were  Level  1  severities,  not 
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running  into  any  problems,  averaging  about  $5,300  to  treat;  seventeen  of  them 
were  Level  2  severities  averaging  $7,500  to  treat;  and  five  were  Level  3 ' s 
averaging  $15,000  to  treat.  Among  those  with  major  operating  room  procedures 
our  numbers  are  small  but  again  you  can  see  differences  by  severity  level.  You 
can  see  the  difference  that  it  makes  to  have  a  moderate  versus  a  major  operating 
room  procedure.  Overall  we  find  a  wide  spread  of  resource  use,  but  not  quite  as 
bad  as  we  saw  before.  Now  it's  $2,900  to  $74,000.  Still  we  see  a  high  CV, 
greater  than  one,  and  much  lower  CVs  for  these  severity  adjusted  groups  except 
for  one  Level  4  patient  who  had  a  major  operating  room  procedure  and  then  died. 
When  these  patients  die  early  we  do  have  a  drop  in  resource  use  with  Level  4 
patients,  causing  a  greater  spread. 

In  figure  17,  DRG  108  is  one  where  we  have  no  different  classification 
of  procedures.  Here  we  agreed  that  all  these  cardiothoracic  procedures,  except 


Figure  17 

ORG  108  -  CARDIOTHORACIC  PROCEDURES 
EXCEPT  VALVE  AND  CORONARY  BYPASS,  WITH  PUMP 
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valve  and  coronary  bypass  with  a  pump,  are  major  operating  room  procedures.  But 
notice  again  we've  got  a  huge  spread  of  resource  use  from  $5,000  up  to  over 
$289,000.  A  third  of  these  patients  were  straight  forward,  not  running  into 
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complications  or  other  diseases,  responding  promptly  to  therapy  even  though 
they'd  had  this  major  operation,  and  averaged  around  $10,000  to  treat.  Twenty- 
six  of  them  were  Level  2  severities  running  into  some  kinds  of  problems  aver¬ 
aging  $19,000  to  treat.  Five  of  them  are  Level  3  severities  running  into  more 
major  problems  averaging  almost  $28,000  to  treat,  and  eight  of  them  were 
catastrophic  averaging  $121,000  to  treat.  Of  those  eight  patients  five  lived 
and  they  averaged  $169,000  to  treat.  So  those  catastrophic  level  patients  have 
great  financial  implications.  I  think  you  can  see  why  those  institutions 
treating  proportionally  more  of  the  Level  2,  3  and  4  severity  cases  end  up 
having  an  average  for  this  DRG  that’s  $30,000.  Whereas,  most  other  institutions 
that  don't  see  that  kind  of  spread  of  severity  within  this  DRG  may  have  an 

average  of  about  $12,000  to  $15,000,  which  is  what  we  have  found  for  most  of  the 

hospitals  that  we're  working  with.  In  fact,  there's  one  hospital  collecting 
severity  data  now  that  is  known  nationally  as  being  a  major  teaching  hospital, 
but  it  also  has  a  very  large  heart  program.  We  first  looked  at  their  data  by 
DRGs  in  the  heart  area.  We  noticed  for  every  DRG  they  had  one  of  the  highest 

average  cost  in  heart  DRGs;  one  of  the  highest  averages  of  any  of  our  data  sets 

across  the  country.  In  fact,  when  they  were  in  negotiations  with  a  third  party 
payer  in  their  state,  the  third  party  payer  said,  "We  can't  send  any  of  our 
patients  to  you  because  your  costs  are  entirely  too  high,  particularly  in  the 
areas  where  you  have  your  specialties,  heart,  pediatrics,  etc."  We  were  able 
to  take  their  data,  score  it  for  severity,  and  were  able  to  show  that  by 
severity  level  in  each  of  these  DRGs  they  had  among  the  lowest  charges  of  any 
of  the  hospitals  in  our  data  base.  In  other  words,  although  their  averages  were 
very  high  it  was  being  driven  by  the  fact  that  they  were  seeing  proportionally 
more  of  the  severely  ill  patients.  But  they  were  in  fact  less  expensive  to 
treat  Severity  Level  l's,  and  they  were  less  expensive  to  treat  Severity  Level 
2's  because  of  the  expertise  that  they  had  developed  there.  I  was  surprised  to 


see  that  because  of  the  name  of  this  place.  I  figured  no  matter  who  walks  in 
the  door,  no  matter  how  sick  they  are,  they're  going  to  give  them  everything 
they  have  the  ability  to  do.  We  found  that  wasn't  the  case  at  all.  In  fact 
they  were  among  the  most  efficient  hospitals  we  have  in  our  data  base  in 
treating  patients  by  severity  level.  So  that's  why  being  able  to  quantify  that 
kind  of  effect  can  be  so  important. 

DRG  296  shown  in  Figure  18  is  another  example  showing  spread  of  severity  in 
a  DRG  with  no  procedures.  You  don't  have  to  have  operating  room  procedures  to 


Figure  18 

DRG  296  -  NUTRITIONAL  &  MI  SC.  METABOLIC 
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have  this  kind  of  problem.  This  is  Nutritional  and  Miscellaneous  Metabolic 
Diseases.  The  range  is  a  lot  smaller  but  again  we  see  the  spread  of  resources 
used  by  severity  level. 

Finally,  Figure  19  shows  an  example  of  a  good  DRG.  There  are  many  of  them 
so  I  couldn't  just  show  you  DRGs  with  a  lot  of  spread.  This  is  DRG  42  which  has 
quite  a  low  C  V  -  - i n  fact,  the  lowest  of  any  we've  seen  so  far.  This  is 
Intraocular  Procedures  except  Retina,  Iris  and  Lens.  These  patients  fell  into 
three  types  of  operating  room  procedures  and  mostly  two  severity  levels.  You 
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Figure  19 

DRG  42  -  INTRAOCULAR  PROCEDURES 
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see  no  Severity  Level  4 ' s  here  and  only  one  Severity  Level  3  patient,  but  even 
among  those  patients  that  are  Severity  Level  1  versus  2  you  see  about  a  $1,300 
difference  in  resource  use.  For  those  hospitals  that  are  treating  propor¬ 
tionally  more  of  their  patients  at  one  end  of  the  scale,  even  though  they're 
only  two  levels  apart  and  it's  a  small  difference,  by  the  time  you  multiply 
those  small  differences  by  large  numbers  of  cases  there  can  be  a  great  financial 

impact . 

In  summary,  this  is  some  data  to  show  you  why  we  are  concerned  about  DRGs 
as  a  mechanism  for  describing  hospital  patients.  Patients  in  a  DRG  even  within 
a  given  hospital  can  vary  greatly  in  charges  because  DRGs  don  t  take  severity 
of  illness  into  account  directly.  Those  hospitals  with  more  than  a  typical  pro¬ 
portion  of  the  most  severe  cases  are  exposed  to  great  financial  risk  as  a 


result . 
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The  next  three  examples,  shown  in  Figure  20,  are  from  a  talk  I  am  giving  on 
Friday  at  the  National  Cancer  Institute  (NCI)  about  what  the  cancer  DRGs  are 
going  to  look  like.  Some  people  from  NCI  visited  me  and  noticed  some  of  these 
character i sti cs  in  our  data  set  and  asked  me  to  prepare  this  talk  for  them. 

This  is  DRG  405  Leukemia  or  Lymphoma,  Age  0-17.  We've  got  106  patients  in  this 
DRG  in  a  hospital  that  happens  to  be  a  cancer  treatment  center.  These  children 
range  from  $700  to  $233,000  for  their  care,  so  again  it's  one  with  a  very  large 
resource  use  spread.  Notice  they  break  down  into  no  operating  room  procedure, 
moderate,  and  major.  Notice  by  severity  level  there  are  dramatic  differences  in 
resource  use  under  no  operating  room  procedure,  under  moderate  and  under  major. 
We  see  the  same  kinds  of  trends  only  now  under  major  procedure  the  costs  are 
much  higher  than  we  saw  before.  That's  because  of  the  bone  marrow  transplants 
that  they're  doing  on  these  patients.  Now  look  at  that  same  DRG  in  another 
teaching  hospital  not  having  a  specialty  cancer  program.  They  only  have  the 
patients  without  an  operating  room  procedure  and  you  notice  the  cost  is  $3,000 
for  Level  1  and  $10,000  for  Level  2  which  is  almost  identical  to  the  cost  of 
Levels  1  and  2  in  the  previous  hospital.  Yet  the  average  for  this  hospital  is 
$7,500  whereas  the  average  for  the  other  hospital  was  $33,000  because  they're 
seeing  all  the  more  severe  patients.  Next,  we  have  data  from  a  non-teaching 
hospital.  This  is  a  community  hospital,  that  had  11  patients  in  this  DRG.  Now 
the  range  is  from  $1,500  to  $20,000  and  again  you  see  by  severity  level  almost 
the  same  resource  use  for  Severity  Levels  1  and  2  as  we  saw  before.  This  data 
has  not  been  adjusted  for  cost-to-charge  ratios  but  what  we  are  finding  in 
looking  at  data  across  hospitals  is  much  greater  similarity  by  severity  level 
across  hospitals  and  much  greater  differences  between  severity  levels  within  a 
hospital.  Those  hospitals  that  have  very  .  DRG  averages  are  most  of  the  time 
that  way  because  of  the  pi  portion  of  more  severely  ill  patients  that  they're 
t^eat i ng . 
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Figure  20 

DRG  405  -  LYMPHOMA  OR  HUKEMIA  AGE  0-17 
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When  the  DRG  people  were  developing  the  DRGs  they  were  working  with  a  limi¬ 
tation  of  wanting  to  keep  the  number  of  groups  under  something  like  500.  Some 
people  thought  that  allowing  each  of  these  DRGs  to  be  split  up  into  a  number  of 
different  groups  might  be  the  total  reason  why  we're  getting  this  extra  explana¬ 
tory  power.  But  the  Disease  Staging  system  has  1,600  to  3,300  groups  and  they 
haven't  done  any  better  in  explanatory  power  than  DRGs  and  the  CPHA  system  has 
8,000  groups  and  they  haven't  done  any  better  than  the  DRGs,  so  we  really 
didn't  think  it  was  a  numbers  problem.  We  thought  it  was  more  what  we  were 
conceptualizing  about  the  patient.  So  we  went  back  and  analyzed  our  data  by 
looking  at  severity  and  procedure  type  within  the  organ  systems.  Severity  is 
attached  to  a  case  and  one  of  the  advantages  of  this  system  we  have  now  found 
but  never  intended  in  the  beginning  is  that  we  don't  start  with  any  hierarchy. 
Severity  is  attached  to  a  case  depending  upon  complications,  other  diseases,  and 
response  to  therapy  so  we  can  use  severity  within  any  other  grouping  strategy. 

We  could  put  it  within  DRGs,  we  could  put  it  within  Major  Diagnostic  Categories 
(MDGs),  we  could  put  it  within  Disease  Staging  groups,  or  within  CPHA's  groups. 
Some  hospitals  have  even  looked  at  it  within  services  in  their  hospital. 

Severity  of  Illness  doesn't  run  into  the  same  kinds  of  problems  that  all  the 
other  patient  classif ication  systems  do,  once  they've  started  from  a 
very  different  hierarchy.  So,  for  example,  to  try  to  put  Disease  Staging  within 
DRGs  would  be  a  problem.  I  was  reading  an  article  today  about  that.  One  DRG 
had  over  200  Disease  Staging  groups  within  the  DRG  because  they  start  with  a 
different  hierarchy.  The  beginning  hierarchy  for  DRGs  is  the  23  organ  systems, 
the  MOCs;  the  hierarchy  for  Disease  Staging  is  41?  diseases;  the  hierarchy  for 
CPHA  is  398  principal  diagnosis  cede  groups.  They  start  at  different 
hierarchies  and  they  can't  fit  together,  but  severity  can  fit  within  any  of  them 
because  we  don't  have  any  hierarchy  under  which  it's  done.  Let  me  show  you  thre*3 
different  MDCs.  One  is  the  most  homogeneous,  MDC  2,  the  Eve.  Figure  21  shows 
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619  eye  cases  from  a  hospital  that  has  an  eye  clinic  and  an  eye  specialty  ser¬ 
vice.  We  grouped  them  by  their  severity  levels  and  type  of  procedure.  Notice 
we  have  no  Severity  Level  4 1 s  here  and  only  three  patients  at  Severity  Level  3 
so  they’re  mostly  Severity  Levels  1  and  2.  But  there  is  a  difference  in 
resource  use  by  Severity  Level  1  versus  2.  Using  only  these  seven  groups  of 
patients  we  were  able  to  explain  almost  79%  of  the  variability  in  resource  use, 
with  a  very  high  F  test  indicating  group  averages  are  very  different  from  each 
other,  and  a  weighted  C V  of  48%.  We  took  those  same  patients  and  put  them  into 
their  corresponding  ORGs,  including  allowing  for  category  468.  The  data  fell 
into  14  DRGs  but  only  explained  13%  of  the  variability  in  resource  use  with  a 
much  lower  F  value  and  a  higher  CV .  So  even  among  the  DRGs  that  are  good, 
you're  going  to  see  much  worse  CVs. 

As  a  second  example,  take  an  intermediate  MDC  involving  the  heart.  Figure 
22  shows  a  data  set  from  one  hospital  with  about  800  patients  categorized  into 
MDC  5,  Circulatory  System.  Now  we  see  all  four  levels  of  severity  and  all  three 
procedure  types.  Notice  that  as  severity  increases  we  get  increasing  resource 
use  except  for  Severity  Level  4's  without  an  operation.  That's  because  those 
patients  either  die  before  they  can  do  anything  for  them,  or  they're  considered 
so  hopelessly  ill  that  they  don't  put  a  lot  of  resources  into  them.  But  when 
they  try  to  save  those  patients,  the  example  shows,  there  are  dramatic  increases 
in  resource  use  for  the  Level  4's.  With  these  12  groups  va  were  able  to  explain 
4d%  of  the  variability  in  resource  use  with  a  good  high  F  and  CVs  in  the  60's. 

We  took  those  same  patients  and  put  them  into  41  different  DRGs.  Notice,  ever’ 
though  this  results  in  more  than  three  times  the  number  of  groups,  we  explain 
only  25%  of  the  variauility  in  resource  use  with  a  much  lower  F  and  a  higher  CV. 
B.t  even  though  45%  is  higher  than  25%  it's  really  not  that  exciting.  Tnat's 


because  deaths  are  included  in  the  data.  So  in  Figure  23  I've  taken  that  same 
data  and  eliminated  the  deaths.  Now  with  these  11  groups  we're  explaining  71% 
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of  the  variability  in  resource  use  compared  to  the  DRGs  explaining  28 %  of  the 
variability  in  resource  use.  Notice  we've  jumped  from  45%  to  71%  and  the  DRGs 
have  gone  from  25%  to  28%.  I  believe  that's  one  of  the  reasons  why  in  the 
Medicare  prospective  payment  system  deaths  have  been  left  in  the  DRGs,  because 
we  have  found,  as  well  as  most  other  researchers,  that  deaths  in  or  deaths  out 
of  the  DRGs  don't  seem  to  make  much  of  a  difference  in  homogeneity.  But  in  our 
groups  it  does  because  of  who  ends  up  in  group  4.  Group  4  for  us  includes 
catastrophic  patients  of  different  types;  those  that  die  early,  those  that  they 
decide  not  to  do  anything  for,  or  those  that  they  try  to  save. 
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Figure  24  is  an  example  of  the  first  eight  MDCs  in  a  data  set  from  one 
hospital  comparing  variability  explained  by  severity  to  the  amount  of  variabil¬ 
ity  explained  by  DRGs.  Consistently,  across  all  the  organ  systems  with  death 
included,  severity  adjusted  groups,  even  though  they  are  fewer  in  number, 
explain  more  of  the  variability,  just  as  in  those  previous  examples.  The  same 
is  true  with  other  charges,  not  only  total  charges.  We  find  the  same  is  true 
with  lab  charges,  with  length  of  stay  and  with  routine  charges.  One  place  where 
it's  not  quite  as  dramatic  is  in  the  area  of  radiology  charges.  We've  con¬ 
sistently  found  that  severity  adjusted  groups  do  better,  but  only  slightly 
better,  than  the  DRGs.  A  number  of  radiologists  have  told  me  they  think  that 
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may  be  the  case  because  radiology  is  more  of  a  diagnostic  tool  than  a  thera¬ 
peutic  tool  so  they  determine  diagnostically  what's  wrong  with  the  patient  but 
then  the  therapeutic  aspects  of  the  patient  are  what  are  being  driven  by  the 
complications  and  the  other  diseases  in  response  to  therapy. 

Across  an  entire  hospital,  when  we  put  severity  within  MDCs  we  end  up  with 
about  200-250  groups,  compared  to  putting  the  patients  into  their  corresponding 
DRGs,  which  results  in  400-450  groups.  Figure  25  shows  an  example  of  data  sets 
from  three  university  teaching,  and  two  community  teaching  hospitals.  We  find 
severity  explaining  between  60%  and  85%  of  the  variability  in  resource  use. 

ORGs  are  explaining  between  30%  and  50%.  In  the  example  shown  in  Figure  26,  we 
have  four  university  teaching  hospitals,  one  community  teaching  and  one  com¬ 
munity  non-teaching  hospital  grouped  in  four  different  ways.  First,  we  group 
the  patients  by  DRGs  just  as  you  saw  before.  Then  we  group  the  patients  by  DRGs 
and  the  physician  who  treated  the  patient  because  I've  often  heard  some  of  the 
DRG  researchers  say  that  all  the  remaining  variability  in  DRGs  that's  not 
explained  by  the  DRGs  is  due  to  which  physician  was  treating  the  patient.  So  we 
wanted  to  investigate  to  what  eAtent  physicians  influence  that.  Then  we  looked 
at  DRGs  and  divided  them  by  severity  similar  to  the  previous  examples.  Finally, 
we  put  all  three  together:  DRGs,  severity,  and  physician.  DRGs,  as  the  pre¬ 
vious  examples  showed,  explained  between  30%  and  50%  of  the  variability  in 
resource  use.  DRGs  and  physicians  explain  about  50%  to  80%  of  the  variability. 
DRGs  and  severity  explain  70%  to  90%  and  all  three  together  explain  90%  to  96%. 

I  find  that  exciting  for  the  following  reasons.  No  matter  which  hospital  we're 
in,  DRGs  and  severity  don't  explain  everything.  There's  still  some  portion  left 
that's  due  to  physicians  in  each  of  these  institutions.  It  varies  by  size,  but 
there's  still  something  left  due  to  physicians.  It  appears  that  there  are  at 
least  two  factors  to  which  we  can  attribute  that  extra  amount  of  explanatory 
power.  One  is  that  DRG  and  severity  groups  are  nice  and  tight.  They  are  good 
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descriptors  of  the  patients,  but  different  physicians  treat  those  patients  dif¬ 
ferently.  Consequently  this  amount  of  explanatory  power  is  due  to  differences 
in  the  efficiency  of  physicians.  A  second  possibility  is  that  DRGs  anu 
severity  groups  are  not  that  tight  and  the  different  physician  specialists  may 
have  different  kinds  of  patients  within  the  same  DRG  and  severity  group.  In 
other  words,  this  may  be  a  classification  problem.  We're  now  looking  at  a 
number  of  hospitals  that  are  collecting  this  data  to  determine  what  differences 
are.  I'm  sure  we're  going  to  find  some  of  both.  But  the  exciting  thing  about 
that  is  that  I  think  it's  removable.  If  it's  due  to  a  classification  problem  we 
can  redefine  the  DRGs  to  take  it  into  account.  If  it's  due  to  a  physician  prac¬ 
tice  problem  we  can  work  with  physicians  in  changing  their  practice  pattern. 

Then  a  DRG  type  grouping  system  together  with  the  severity  measure  will  be  able 
to  explain  90%  to  96%  of  the  variability  in  resource  use  within  an  institution. 
That,  I  think,  is  sufficient  for  prospective  reimbursement  purposes.  I  think, 
however,  with  DRGs  currently  only  explaining  30%  to  40%  of  the  variability  in 
resource  use,  with  60%  to  70%  unexplained,  that  may  be  too  much  unexplained 
variance  to  expect  hospitals  to  be  reimbursed  on  a  prospective  basis--especial ly 
when  hospitals  are  expected  to  run  on  a  one  to  three  percent  margin  without  a 
lot  of  room  for  error. 

To  show  how  much  variability  within  DRGs  is  explained,  Figure  27  is  an 
example  of  how  we've  tried  to  put  this  data  together.  It's  really  difficult  for 
me  to  picture  what  this  is  doing  and  try  to  share  with  you  what  I  see.  This  is 
data  from  three  university  teaching  hospitals,  two  community  teaching,  and  one 
community  non-teaching  hospital.  The  patterns  show  various  ranges  of  percent 
RIV  distributed  across  each  hospital's  DRGs.  First  of  all  what  we  see  is  the 
university  teaching  hospitals  (UT1,  UT2,  &  UT3)  look  similar  in  terms  of  the 
explanatory  power  of  severity.  There  are  about  20%  of  their  DRGs  where  severity 
isn't  explaining  much--the  homogeneous  DRGs.  Then  there  are  about  20%  that  have 
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from  20%  to  40%  explained,  20%  with  from  40%  to  60%  explained,  about  20%  with 
from  60%  to  80%  explained,  and  about  20%  of  their  DRGs  have  from  80%  to  100% 
variability  explained  by  severity.  It's  generally  what  we  see  in  university 
teaching  hospitals,  which  means  that  the  average  explanatory  power  is  around  50% 
in  terms  of  RIV  by  severity  within  DRGs.  Notice  how  different  the  patterns  are 
for  these  community  teaching  or  community  non-teaching  hospitals  (CT1,  CT2,  & 

C).  They  show  that  within  40%  to  50%  of  their  DRGs,  severity  has  no  explanatory 
power.  In  other  words,  they  are  not  seeing  a  spread  of  severity  in  the  same 
DRGs.  The  middle  range  is  very  condensed.  They  have  about  the  same  number  of 
DRGs  or  percentage  of  DRGs  that  have  between  80%  and  100%  of  the  variability 
explained  by  severity.  Figure  28  repeats  the  example  but  eliminates  all  DRGs 
that  had  less  than  eight  cases  in  them.  Notice  that  it  doesn't  change  much  for 
university  teaching  hospitals.  In  fact,  the  proportion  with  no  explanatory 
power  goes  down.  But  it  does  change  to  some  extent  for  these  community  hospi¬ 
tals.  What  we're  finding  is  for  the  teaching  hospitals,  sample  size  has  not 
made  that  much  difference  in  terms  of  the  explanatory  power.  But  for  the  other 
institutions  it  has  made  a  little  difference. 

Since  I've  shared  why  I  think  homogeneity  is  so  important  in  terms  of 
resource  use,  as  a  statistician  who  knows  that  you  can  lie  with  statistics,  I 
want  to  state  that  it  is  not  necessarily  a  fatal  problem  that  DRGs  have  a  spread 
of  resource  use  in  them.  In  other  words,  if  the  DRG  distribution  has  a  peaked 
normal  distribution  (with  most  of  the  patients  having  resource  use  around  the 
mean,  with  some  at  the  high  end  and  some  at  low  end,  but  most  of  them  are  around 
the  mean),  and  if  that's  the  way  all  hospitals'  data  looked  and  everyone  saw  the 
same  proportion  at  the  high  end  as  they  see  at  the  low  end,  then  having  a  DRG 
system  for  reimbursement  would  not  be  a  fatal  problem.  Because,  if  they  ever 
got  a  high  cost  case  they  would  eventually  get  a  low  cost  case  and  it  would 
balance  out  and  all  hospitals  would  be  equitably  treated.  The  next  examples 
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show,  unfortunately,  that  such  is  not  the  case.  We've  now  discovered  that 
there  can  De  very  bad  incentives  in  the  DRG  system.  One  of  them  was  described 
in  an  article  in  the  Wall  Street  Journal  back  in  February.  It  was  an  editorial 
on  how  some  hospitals  in  the  United  States  are  contracting  with  the  physicians 
in  their  area  to  bring  the  less  severely  ill  patients  into  those  institutions. 
The  patient  comes  in  and  uses  less  resources  than  what  the  DRG  reimbursement 
amount  is.  They  split  the  difference  with  the  physi ci an--the  physician  gets 
half  and  the  hospital  gets  half.  So,  even  if  the  patients  were  randomly  distri¬ 
buted  before,  with  those  kinds  of  incentives  being  allowed  in  the  system,  we 
know  in  the  long  run  they  won't  end  up  being  normally  distributed.  But  now  let 
me  indicate  how  different  hospitals  may  look  by  severity  level.  Figure  29  is  an 
example  of  three  university  teaching,  two  community  teaching  and  one  community 
non-teaching  hospital.  This  represents  the  proportion  of  patients  over  the 
whole  institution  that  are  Level  1,  the  proportion  that  are  Level  2's,  the  pro¬ 
portion  of  3 ' s ,  and  the  proportion  of  4's.  Notice  how  different  the  range  of 
proportions  of  Level  l's  are  even  within  the  teaching  hospitals  from  40%  up  to 
70%.  On  the  opposite  end  of  the  extreme  the  percent  of  3's  and  4's  seem  to  be 
quite  similar  here.  In  terms  of  community  hospitals,  in  fact,  we  see  a  sort  of 
inversion  where  this  community  non-teaching  hospital  has  the  lowest  percentage 
of  Level  1  patients  in  the  state,  and  the  community  teaching  hospital  (CT1)  has 
the  highest  percent--almost  90%  of  their  patients  are  Level  1  severity.  You  see 
dramatic  differences  in  the  proportions  of  Level  1  severity  and  also  not  quite 
as  dramatic  differences  in  the  proportions  of  Level  3's  and  4's,  but  realize  the 
financial  implications  of  those  higher  levels  of  seventy--one  percentage  point 
at  that  end  of  the  scale  can  mean  millions  of  dollars  for  a  hospital.  But 
that's  not  the  oniy  thing  you  have  to  look  at.  Because  it  turned  out  that  for 
UT2  one  of  the  reasons  they  may  have  had  so  few  Level  l's  is  that  they  have  no 
OB  service  and  no  pediatric  service.  Now  a  lot  nf  OB  and  pediatric  patients  are 
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Level  1.  So,  you  have  to  look  at  how  they  distribute  into  different  DRGs. 

That  we've  done  in  Figure  30.  We're  going  to  go  from  a  specific  case  to  a  more 

general  case.  DRG  148  is  Major  Small  and  Large  Bowel  Procedures.  We  see  data 

from  two  university  teaching  hospitals  and  two  community  teaching  hospitals. 
Notice  that  UT1  has  9%  Level  4 ' s  and  25%  Level  3's.  About  35%  Levels  3  and 
4  patients  are  accumulating  72%  of  their  dollars  so  they  have  great  financial 
implications  even  though  they  are  small  in  terms  of  numbers.  UT2  also  has  a 
disparity  in  terms  of  the  percentage  of  patients  versus  the  dollar  implication, 

having  only  18%  of  their  patients  at  Level  3  but  37%  of  their  charges.  So  you 

can  see  dramatic  differences  in  percentage  of  severity  across  different  types 
of  institutions  within  a  DRG.  You  might  ask,  "Susan,  did  you  pick  the  one  DRG 
that  happened  to  show  that  kind  of  a  difference?"  Let  me  first  tell  you  about 
one  hospital's  data  and  then  we'll  compare  it  with  other  hospitals.  First,  the 
question  is,  how  many  DRGs  have  that  problem?  One  data  set  from  a  university 
teaching  hospital  had  about  10,000  cases  going  into  441  DRGs  and  accumulating 
62  million  dollars  in  charges.  We  found  in  this  case  20%  or  83  DRGs  had  only 
one  severity  level  in  them.  Those  are  nice  and  homogeneous  with  respect  to 
severity.  Those  are  the  kind  we  want.  However,  we  found  that  they  had  almost 
none  of  the  hospital's  charges  for  the  patients  in  that  group  and  only  3%  of  the 
patients,  so  we've  got  a  number  of  homogeneous  DRGs;  it's  just  that  in  this 
institution  they  have  no  financial  imp! ications  —  almost  nobody's  in  them.  Most 
of  the  DRGs  had  two  levels  of  severity.  Forty-five  Descent  or  197  DRGs  had  only 
two  levels  of  severity.  They  affected  19%  of  the  charges  and  one-third  of  the 
patients.  In  combination,  64%  of  the  DRGs  in  this  institution  are  either  homo¬ 
geneous  )r  have  two  levels  of  seventy.  They  are  affecting  20%  and  about,  one- 
th’rd  of  tt.e  patients.  On  the  othe-"  hand,  93  and  68  DRGs,  respectively,  or  a 
total  of  36%  of  the  DRGs  have  three  or  four  levels  of  severity  in  them.  Those 
3rp  heterogeneous  with  -ns  pen  to  severity  but  they  are  the  minority  of  the 
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DRGs.  Unfortunately,  however,  for  this  hospital  they  are  affecting  80%  of  the 
charges  and  two-thirds  of  the  patients.  So  those  heterogeneous  DRGs,  even 
though  they  are  minorities  in  terms  of  their  numbers,  have  enormous  financial 
implications  and  lots  of  patients  in  them.  But  again  this  wouldn't  be  a  problem 
if  all  hospitals  looked  the  same.  But  I'm  sure  you  can  imagine  they  don't. 

Let's  see  how  that  looks  for  different  hospitals.  First  let's  look  at  severity 
levels  in  DRGs  for  two  university  teaching  and  three  community  teaching  hospi¬ 
tals.  (See  Figure  31)  Notice  that  in  UT1,  this  is  the  university  teaching 
hospital  we  just  discussed,  20%  of  the  DRGs  had  one  level  of  severity,  45%  had 
two,  21%  had  three  and  15%  had  four.  So  there's  a  total  of  36%  having  three  or 
four  levels  of  severity  compared  to  UT2  that  has  40%  of  their  DRGs  with  one 
level  of  severity  and  only  15%  with  three  or  four,  and  the  other  institutions 
having  50%  to  60%  of  their  DRGs  with  one  level  of  severity  and  only  10%  to  12% 
of  their  DRGs  with  three  or  four  levels  of  severity.  So  there  are  dramatic  dif¬ 
ferences  in  how  many  DRGs  and  the  proportion  of  those  DRGs  in  different  hospi¬ 
tals.  There  are  also  dramatic  differences  in  how  many  patients  are  affected. 

In  the  Figure  32  recall  UT1  with  63%  of  the  patients  in  those  DRGs  with  three 
and  four  levels  of  severity  compared  to  35%  for  UT2  and  CT1  and  20%  to  25%  for 
CT2  and  CT3.  Finally,  notice  the  dollars  at  risk  as  seen  in  Figure  33.  In  UT1 
there  are  80%  of  the  dollars  at  risk  in  those  DRGs  with  Severity  Levels  3  and  4, 
compared  to  50%  in  the  next  two  institutions,  and  40%  in  the  last  two  institu¬ 
tions.  You  can't  tell  where  differences  are  going  to  be  just  knowing  it's  a 
university  teaching  hospital  because  UT1  and  UT2  look  dramatically  different. 

The  community  teaching  hospital,  CT1,  looks  more  like  UT2. 

The  National  Cancer  Institute  noticed  something  I  really  had  never  seen  in 
looking  at  the  data.  They  were  specifically  interested  in  the  Cancer  DRGs  and 
ir  answering  the  question  that  maybe  some  of  you  have  read  about  that's  being 
raised  in  Congress  now  or  in  Washington  with  HCFA,  about  creating  a  DRG  471  for 
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cancer  protocol  patients  versus  nonprotocol  patients.  We  did  not  have  the  defi¬ 
nition  of  protocol  in  our  data  set,  but  we  were  able  to  look  at  the  cancer  DRGs 
by  severity  and  I  think  the  people  at  NCI  were  shocked  to  see  the  results.  We 
looked  at  the  DRGs  that  they  called  cancer  DRGs.  They  labeled  35  that  they  felt 
were  cancer  DRGs.  They  asked  what  did  a  cancer  center  see  in  the  spread  of 
severity  in  those  DRGs  compared  to  a  community  hospital.  So  we  took  four  com¬ 
munity  hospitals'  data  and  looked  at  the  maximum  number  severity  levels  in  each 
of  those  35  cancer  DRGs.  If  any  community  teaching  hospital  saw  a  spread  of 
severity  we  picked  it  up.  Then  we  looked  at  one  university  teaching  hospital 
that  had  a  cancer  treatment  center.  (See  Figure  34)  Only  in  one  of  the  35 
cancer  DRGs  did  the  university  teaching  hospital  see  only  one  level  of  severity. 
In  almost  60%  of  the  cancer  DRGs,  58%  to  be  exact,  they  saw  four  levels  of 
severity.  About  20%  of  the  DRGs  saw  two  levels  of  severity  and  about  20%  saw 
three  levels  of  severity.  Compare  this  to  the  community  hospital,  the  maximum 
in  any  of  our  community  hospital  data  sets,  where  50%  of  their  DRGs  showed  only 
one  severity  level  and  only  4%  showed  four  severity  levels.  Within  the  cancer 
group  in  particular  (UTCA,  CTCA),  we  have  a  much  worse  dichotomy  in  the  spread 
of  severity  of  DRGs  than  even  the  overall  hospital.  We  then  looked  at  the  55 
DRGs  that  they  said  could  possibly  have  cancer  in  them,  things  like  crainiotomy 
which  could  be  for  cancer  or  not.  The  "may  have  cancer"  (UT?CA,  CT?CA)  looks 
very  similar  to  the  "cancer"  in  terms  of  their  distribution  of  the  four  levels 
of  severity.  And  then  we  took  all  the  DRGs  in  terms  of  the  whole  hospital  to 
see  whether  cancer  was  showing  greater  spread  than  overall.  And  what  you  see  is 
there  really  are  dramatic  differences  in  the  cancer  DRGs  that  are  even  more 
striking  in  terms  of  the  proportion  of  DRGs  that  have  four  levels  of  severity  in 
them.  Now  that  the  NCI  people  have  seen  this  data,  they're  wondering  if  the 
protocol  issue  is  the  right,  issue  for  them  to  be  looking  at.  It  may  really  be  a 
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much  greater  problem  in  terms  of  the  spread  of  severity  and  could  have  much 
greater  financial  implications  than  the  protocol  or  nonprotocol  concern. 

Now  let's  discuss  some  internal  uses  of  severity  data.  None  of  the  hospi¬ 
tals  that  are  now  collecting  severity  data  across  the  United  States,  except  for 
Hopkins  in  its  experiment  with  the  Maryland  Cost  Review  Commission,  are  being 
reimbursed  on  the  basis  of  a  Severity  of  Illness  Adjustment.  They  are  col¬ 
lecting  this  data  more  for  internal  management  purposes.  We  have  looked  at 
physician  practice  patterns  within  a  hospital  to  see  which  physicians  are  prac¬ 
ticing  differently  from  their  colleagues,  controlling  for  severity.  We  analyze 
the  data  by  grouping  it  as  finely  as  I  know  how,  putting  the  patients  into  their 
DRG,  dividing  by  severity  and  dividing  by  procedure  type,  just  the  way  it  was 
done  in  those  DRGs  we  examined  previously.  This  gives  us  the  greatest  explana¬ 
tory  power,  between  70%  and  90%  for  each  hospital.  We  don't  want  to  add  physi¬ 
cians  here,  we  want  to  get  what  the  hospital  standard  is  by  DRG  severity  and 
procedure  type.  Then  we  take  each  physician's  patient  in  a  DRG  severity  and 
procedure  type  and  compare  the  resource  use  for  that  patient  to  the  norm  for  the 
whole  hospital.  The  solid  line  in  Figure  35  represents  the  zero  line,  so  any 
patient  that  is  treated  with  the  norm  for  the  whole  hospital  will  fall  on  that 
line.  If  they  fall  below  that  line  that  means  that  the  physician  has  used  less 
resources  than  the  norm  for  the  corresponding  DRG  severity  and  procedure.  Above 
the  line  means  more  resource  use  than  the  corresponding  DRG  severity  and  proce¬ 
dure  norm.  We  examine  those  deviations  for  total  charges,  for  routine  charges, 
for  lab,  for  radiology,  and  for  pharmacy.  We  can  look  at  a  physician's  practice 
pattern  and  see  how  he  differs  from  the  norm  for  each  of  the  respective  DRG 
severity  and  procedures  categories.  Thus,  we  can  put  all  of  his  patients  on  one 
graph  because  each  one  of  the  dots  is  compared  to  the  respective  DRG  severity 
and  procedure  norm.  Now,  looking  at  Physician  116,  who  treated  33  patients 
during  this  period,  notice  that  there  are  some  above,  but  most  of  them  are 


Figure  35 
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falling  below  the  line.  Whether  it's  Severity  Level  1,  2,  or  3,  he's  falling 
below  the  line  for  most  of  his  patients  in  terms  of  total  charges,  below  the 
line  in  terms  of  routine,  below  the  line  in  terms  of  lab,  around  zero  for 
radiology,  and  below  the  line  in  terms  of  pharmacy.  Overall  this  physician,  no 
matter  what  he's  treating,  tends  to  treat  his  patients  very  consistently  with 
wh3t  we  might  call  more  efficient  resource  use  than  his  colleagues  compared  to 
the  DRG  severity  and  procedure  category  into  which  that  patient  falls.  1  found 
this  kind  of  consistency  remarkable  in  the  beginning,  but  this  is  the  pattern  we 
see.  Physicians  practice  very  consistently.  Of  course  they  don't  all  practice 
consistently  low.  Physician  117,  shown  in  Figure  36,  is  an  example  oi  a  physi¬ 
cian  who  has  a  few  below  the  norm  but  most  of  them  are  high  in  terms  of  total 
charges--more  resource  use  than  his  colleagues.  Most  of  it  results  from  routine 
charges  being  higher.  Notice  the  pattern  for  lab,  radiology  and  pharmacy  is 
near  zero.  When  we  accumulated  data  for  all  41  patients,  they  accumulated 
almost  $147,000  more  than  we  would  have  expected  if  he  treated  them  at  the  norms 
for  the  hospital.  Thus,  this  physician  was  "costing"  the  hospital  $147,000.  It 
was  because  he  kept  the  patients  551  days  longer  than  the  norm,  resulting  in 
$112,000  more  in  routine  charges.  The  lab,  radiology,  and  the  pharmacy  charges 
were  just  plus  or  minus  a  few  thousand  dol 1 ars--real ly  within  noise  levels. 

This  physician  is  not  using  more  ancillary  support,  he's  keeping  his  patients 
longer  for  whatever  reason.  When  we  talk  to  these  people,  they  usually  say, 
"That's  what  I've  always  done,  I  like  to  keep  my  patients  around  a  little 
longer."  You  can  pick  it  out  very  clearly  with  this  adjustment.  That  again  is 
what  we  typically  see--incredible  consistency.  When  physicians  practice,  they 
practice  consistently  no  matter  what  DRG  or  severity  or  procedure  type  they  are 
dealing  with  and  it's  either  consistently  around  zero,  consistently  high,  or 
consistency  low.  Very  rarely  do  we  see  a  physician  who  varies  his  practice 
greatly  as  it  appears  Pnysician  40  in  Figure  37  does.  This  physician  appears  to 
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go  from  the  bottom  to  the  top  on  every  single  dimension.  When  we  first  found 
this  physician,  we  hypothesized  all  kinds  of  events  in  his  life  during  the 
course  of  the  collection  of  this  data  that  could  have  caused  this  to  happen.  We 
ultimately  went  back  to  the  hospital  and  they  identified  the  physician.  We 
never  know  the  identity  of  a  physician.  We  only  have  an  attending  physician 
code  attached  to  the  discharge  abstract  and  severity  data.  We  first  noticed  a 
very  large  number  of  cases  and  it  turned  out  that  this  physician  was  the  chief 
of  his  service  and  insisted  that  everybody  be  admitted  with  him  as  the  attending 
physician.  But  then  the  residents  took  over.  So  there  were  some  residents  who 
used  less  resources  and  other  residents  who  used  more  resources  and  the  ramifi¬ 
cations  for  the  hospital  were  quite  extensive.  Notice  these  403  patients  accu¬ 
mulated  $451,000  more  in  resource  use  than  we  would  have  expected  in  the  same 
DRG  severity  and  procedure  groups--not  because  of  length  of  stay,  interestingly 
enough,  which  is  only  126  days  over  for  403  patients,  or  about  a  quarter  of  a 
day  per  patient.  It  was  high  routine  charges,  $118,000,  because  they  stayed  in 
the  ICU  and  CCU  too  long;  $184,000  more  in  the  lab--they  were  doing  a  lot  of  lab 
testing;  $60,000  more  in  radiology  and  $30,000  more  in  pharmacy.  What  this 
hospital  learned  was  that  this  chief  was  not  monitoring  his  residents  very 
closely.  They  subsequently  hired  an  assistant  to  help  him  work  with  the  team 
and  they  also  started  working  with  the  residents,  looking  at  those  on  both  sides 
of  the  norm.  They  worked  particularly  in  the  areas  of  the  ICU  and  the  lab  to 
reduce  their  resource  utilization.  With  this  kind  of  adjustment  we  can  pinpoint 
when  there  are  differences  and  deviations  and  what  the  causes  are. 

You  might  say,  "Susan,  it  now  looks  exciting  but  we  can  do  that  with  DRGs 
too."  Let  me  share  with  you  what  some  of  our  data  has  revealed  because  this  is 
some  of  the  data  that  I  find  to  be  most  disconcerting.  Somehow  I'm  able  to 
separate  myself  from  the  situations  involving  an  individual  hospital  where 
severity  analysis  may  go  against  that  hospital,  because  I  see  a  hospital  as  an 
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entity.  But  I  have  trouble  with  this  data  when  I  see  the  results  by  physi¬ 
cian.  I  tend  to  think  more  of  physicians  as  people  I  work  with  and  human 
beings  and  I  know  what  it  means  when  you  are  criticized  and  when  that  criticism 
may  be  inappropriate.  The  example  of  physician  practice  patterns  in  Figure  38 
was  created  from  a  sample  of  10  physicians.  The  data  has  been  accumulated  in 
two  different  ways.  For  each  physician's  patients  we've  accumulated  their 
deviations  from  the  norm  where  the  norm  is  created  by  producing  DRG  severity  and 
procedure  groups,  the  best  ones  I  know.  This  is  represented  by  the  solid  dots. 
Then  we've  done  the  identical  calculations  but  only  used  the  DRG  as  the  basis 
for  the  norm,  not  controlling  for  severity  and  procedure.  That  is,  we  use  the 
DRG  average  as  the  standard  and  compare  that  physician  to  the  DRG  average. 

We've  accummulated  the  total  charges  in  each  of  these  ways  and  that's  what  is 
represented  on  the  graph  for  each  physician.  The  disconcerting  part  about  this 
data  is  that  often  you  get  a  very  different  picture  of  a  physician  when  you've 
controlled  for  severity  and  DRG  compared  to  DRGs  alone.  Physicians  4  and  5  look 
almost  identical  by  DRGs.  They  both  look  like  they're  using  slightly  more 
resources,  about  $14,000  more  (14305  and  14744),  than  their  colleagues  when  you 
compare  each  of  their  patients  to  the  DRG  norm.  But  for  Physician  4,  when  we 
control  for  severity,  we  actually  find  that  he  was  much  more  efficient  than  his 
colleagues  treating  his  patients  with  $32,000  less  in  resource  use  (32064). 
Whereas,  for  Physician  5,  who  looked  identical  to  him  in  DRGs,  when  we  control 
for  severity,  we  find  he  was  using  almost  $63,000  more  in  resources  than  his 
colleagues  (62583).  So  two  physicians  who  look  identical  in  DRGs  can  have  very 
different  results  when  we  control  for  severity.  Based  on  DRGs,  you  probably 
would  have  talked  to  Physician  7  saying  "Look,  you're  using  $58,604  more  than 
the  norm  for  your  patients,  what  are  you  doing  to  them?"  When  we  control  for 
severity,  however,  we  find  he  is  right  on  the  average,  only  $2,207  above. 


Physician  8,  by  DRGs  alone,  looks  in  fine  shape,  but  when  you  control  for 
severity,  you  find  out  that  he  is  really  overutilizing.  So  by  looking  at  only 
the  ORG  data  you  can  be  criticizing  or  praising  the  wrong  physicians.  Where 
that's  going  to  take  medical  practice  in  the  future,  I'm  not  sure.  But  I  think 
it's  something  that  we  have  to  be  very  cautious  about  using  this  kind  of  data. 

The  question  is  asked  as  to  how  the  above  results  may  be  tied  to  quality 
assurance  and  risk  management  procedures  within  the  various  facilities  that 
we've  worked  with  in  terms  of  those  physicians  that  have  higher  severity 
patients  possibly  due  to  quality  of  care  problems.  There  have  only  been  two 
hospitals  where  we've  looked  at  that  in  detai 1 --Hopkins  and  Stanford.  At 
Stanford  they're  having  their  utilization  review  personnel  collect  the  severity 
data  and  at  the  same  time  flag  those  cases  that  they  think  are  possibly  higher 
severity  due  to  quality  of  care.  I  understand,  though,  that  the  numbers  of 
those  cases  have  been  very  few.  In  terms  of  the  individual  physicians  they've 
looked  at,  they  have  not  come  back  and  said  that  these  kinds  of  differences  are 
due  to  quality  of  care  problems.  Of  course  the  physician  who  uses  $62,000  less 
by  severity  of  case  is  not  necessarily  a  winner.  He  could  be  a  physician  who  is 
just  not  doing  anything  for  his  patients.  Then  the  quality  of  care  would  come 
in  and  we'd  have  to  look  at  that  sort  of  situation.  We've  got  to  go  beyond  this 
in  terms  of  determining  the  reasons  for  some  of  these  deviations.  The  other 
possibility  is  that  you  may  have  a  person  who  keeps  his  patients  at  Levels  1  and 
2  by  a  more  costly  expenditure  by  predicting  complications,  watching  for  them, 
etc.,  and  he  may  actually  look  above  the  norm  for  the  level  of  severity  he's 
seeing.  But  when  we  look  at  his  pattern  of  practice  he  may  very  seldom  get  into 
avoidable  compl ications--high  quality  care.  But  all  of  that  has  to  be  looked 
into  and  this  is  what  the  individual  hospitals  themselves  do.  They  look  at  the 
quality  of  care  that  has  been  provided  to  these  patients  to  see  whether  quality 
might  possibly  be  one  of  the  explanations  for  these  differences.  To  date,  as  I 
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mentioned,  none  of  the  hospitals  have  come  back  and  said  they've  been  able  to 
explain  even  some  of  their  physicians'  performance  differences  by  quality  of 
care  issues.  It's  been  more  of  a  practice  pattern  situation  and  when  the  phy¬ 
sicians  see  it,  they're  able  to  adjust  their  practice. 

Many  of  these  questions  have  come  about  because  we've  found  that  some 
hospitals  have  higher  costs  per  case,  longer  lengths  of  stay  per  case;  we've 
found  that  some  physicians  have  higher  costs  per  case  and  longer  lengths  of  stay 
per  case;  and  when  we  ask  people  about  that  their  usual  answer  is  that  we  treat 
sicker  patients.  The  question  then  becomes  how  can  you  quantify  sicker  patients 
so  that  you  can  accurately  assess  what  is  attributable  to  differences  in 
severity  of  illness.  Someday  we'll  be  able  to  determine  what's  due  to  quality 
of  care,  but  we  don't  yet  have  that.  I  believe  that  if  we  could  use  a  severity 
based  system  it  could  be  very  useful  in  controlling  cost  in  a  prospective 
payment  system.  Many  people,  when  they  initially  see  this  kind  of  data  and  see 
the  various  resources  used  by  severity,  say,  "If  you  pay  more  for  a  sicker 
patient  isn't  that  going  to  cost  the  whole  system  more  money?"  I  believe,  used 
properly,  it  will  cost  the  whole  system  less  money  in  the  long  run,  and  Figure 
39  shows  the  four  reasons  why.  First,  if  we  equitably  reimburse  for  all 
severity  levels  we'll  reimburse  to  a  Level  1,  then  a  Level  2,  then  a  Level  3, 
and  then  a  Level  4  appropriately.  We  would  not  have  an  incentive  to  over  admit 
less  severely  ill  patients.  That's  a  problem  in  our  current  system.  I  was  at  a 
conference  yesterday  morning  with  people  involved  with  the  Medicaid  system. 

Every  Medicaid  patient  is  reviewed  in  the  State  of  Maryland,  but  they  are  still 
finding  increases  because  there's  a  large  gray  area  as  to  whether  you'd  admit  a 
patient  or  not.  If  you  want  to  find  reasons  to  admit  him,  you  find  reasons;  and 
if  you  do  not  want  to  admit  him,  you  find  reasons  not  to.  But  if  there  were 
no  incentive  to  admit,  say,  a  Severity  Level  1  patient,  you  wouldn't.  I  heard 
some  people  in  the  Administration  saying,  "Susan,  we  might  even  be  able  to  do 


Figure  39 

USES  OF  THE  SEVERITY  INDEX 
FOR  COST  CONTROL  IN  A 
PROSPECTIVE  PAYMENT  SYSTEM 


o  EQUITABLE  REIMBURSEMENT  FOR  ALL  SEVERITY  LEVELS 

No  incentive  to  over-admit  less  severely  ill  patients 
No  incentive  to  transfer  out  more  severely  ill  patients 


o  RATIONAL  CONTROL  OF  RESOURCES  BY  SEVERITY  LEVELS 
o  IDENTIFY  PHYSICIANS  WITH  ATYPICAL  PRACTICE  PATTERNS 
o  RESOURCES  DEVOTED  TO  MOST  SEVERELY  ILL  PATIENTS 

Cost  and  moral  issues  for  Severity  Level  4  patients 


what  some  HMOs  are  currently  doing  by  paying  their  physicians  more  to  treat 
patients  that  are  less  severely  ill  on  an  outpatient  basis."  There  are  several 
HMOs  I  know  of  around  the  country  that  are  paying  their  physicians  125%  of  their 
usual  fee  to  treat  certain  patients,  who  otherwise  might  be  admitted,  on  an  out¬ 
patient  basis  and  everybody  saves  money.  Also  there'll  be  less  incentive  to 
transfer  out  more  severely  ill  patients  if  the  hospital  could  be  appropriately 
reimbursed.  Again,  there  wouldn't  be  the  dumping  phenomenon  if  the  hospital 
could  take  care  of  them  and  be  reimbursed.  That  wouldn't  necessarily  save  money 
but  what  it  would  do  is  make  the  system  more  equitable. 

The  second  and  third  points  are  where  we're  going  to  get  our  big  savings-- 
rational  control  of  resource  use  by  severity  level  and  identification  of  physi¬ 
cians  with  atypical  practice  patterns.  We've  already  seen  the  extent  to  which 
this  point  could  have  an  effect  in  an  institution,  but  let  me  tell  you  a  little 
more  about  how  hospitals  have  used  their  severity  data  related  to  this  point. 

A  number  of  our  institutions  have  taken  their  severity  data  and  looked  at  it 
with  a  physician  group  that  had  a  desire  to  do  something.  One  example  comes 
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from  a  hospital  in  New  England  where  several  of  the  cardiothoracic  surgeons  went 
to  the  hospital  president  and  said  they'd  like  to  open  a  heart  and  liver 
transplant  program.  The  head  of  the  hospital  said  that  would  be  great  but  it 
costs  money  and  in  this  day  and  age  that's  not  possible.  But  he  said  maybe  they 
could  earn  the  money  to  do  it.  So  the  physicians  had  an  incenti ve--they  wanted 
something.  He  first  worked  with  them  and  said  "Do  you  like  the  way  the  DRGs  are 
defined  for  your  patients  and  what  you're  doing  for  them?"  They  decided  it 
wasn't  the  way  they  wanted  to  define  them.  They  redefined  the  DRG  groups. 
Different  procedures  were  grouped  together.  Then  they  divided  those  patients  by 
severity  because  they  were  collecting  severity  on  an  ongoing  basis.  So  they  had 
severity  data  on  these  patients  and  because  severity  has  no  hierarchy  they  could 
put  severity  in  these  new  groups  they  had  formed.  Then  they  pulled  the  patient 
records  and  a  team  of  surgeons  went  over  each  of  the  patients  in  groups. 

Severity  Levels  1,  2,  and  3  were  in  their  DRGs.  They  didn't  include  the  4's 
because  of  their  variability.  They  decided  in  every  single  case  that  there  were 
things  they  were  doing  that  they  could  do  better.  They  could  improve  on  the 
efficiency,  how  quickly  they  were  doing  it,  and  some  of  the  tests  they  found 
they  really  didn't  need.  So  as  a  team  they  made  these  decisions.  Then  they 
tried  it  out  for  three  months  to  be  sure  that  it  wasn't  hurting  the  quality  of 
care  in  any  way.  Once  they  saw  that  they  could  do  this  in  a  stable  way  in  terms 
of  how  they  were  all  treating  patients  at  a  similar  severity  level  similarly, 
they  went  to  some  of  the  local  HMQs  and  were  able  to  make  contracts  with  the 
HMOs  agreeing  to  treat  their  patients  on  a  contract  basis  for  a  fixed  price  per 
new  DRG  and  severity  level.  They  set  the  price  in  advance  as  long  as  the  HMO 
agreed  to  send  their  patients  to  the  hospital.  Thus,  the  hospital  agreed  to  be 
at  risk  because  if  the  patients  got  sicker  because  of  poor  quality  of  care  the 
hospital  would  be  reimbursed  at  the  lower  level  of  severity.  They  were  able  to 
become  the  most  economical  place  to  go  for  this  kind  of  surgery,  because  they 


analyzed  very  carefully  what  they  were  doing  with  these  groups.  With  the  money 
they  earned  from  treating  these  patients  they  were  able  to  fund  their  heart  and 
liver  transplant  program.  The  physicians  are  thrilled  and  the  hospital  is 
thrilled.  Now  the  orthopedic  surgeons  are  doing  something  similar  in  the  ortho¬ 
pedic  area.  The  physicians  in  medicine  are  now  getting  together  to  see  how,  by 
severity  level,  they  can  rationally  examine  what  they're  doing.  In  the  past, 
with  DRGs  that  ranged  from  $1,000  to  $200,000,  they  couldn't  tell  where  they 
were  efficient,  where  they  were  inefficient,  or  what  they  were  doing  differently 
because  of  the  types  of  patients  that  were  in  there.  I  believe  that  can  save  us 
a  great  deal  of  money  because  I  know  there  are  things  we’re  doing,  and  I'm  sure 
all  of  you  know  there  are  things  that  are  being  done,  that  if  we  looked  at  them 
more  rationally  could  be  eliminated. 

Finally,  the  resources  devoted  to  the  most  severely  ill  patients,  the  Level 
4's  in  our  system,  can  be  examined.  You  can  see  in  the  data  I  shared  with  you 
today  there  are  moral  issues  involved  with  those  patients.  Currently  they  are 
hidden  within  DRGs  but  I  think  this  is  a  group  of  patients  that  we  should 
really  look  at  separately  because  they  are  very  expensive.  The  question  is, 
when  we  have  limited  resources,  what  should  we  be  doing--what  are  we  getting  as 
a  result  of  this  resource  use.  It's  something  that  I  think  we're  going  to  need 
to  examine  as  a  society.  So  for  these  reasons,  I  believe  by  controlling  for  how 
sick  the  patients  are  and  getting  more  homogeneous  groups  we  can  save  money 
in  reimbursing  hospitals  for  the  delivery  of  health  care. 


CASE  MIX:  IMPLICATIONS  FOR  MANAGEMENT 


Forrest  W.  Graves,  Ph.D. 

Utilization  of  hospital  resources,  hospital  management,  market  share, 
financial  stability,  capital  formation,  and  a  hospital's  organizational  goals 
and  primary  mission  are  but  a  few  of  the  key  elements  in  the  strategic 
planning  process.  Strategic  planning  in  health  care  is  moving  hospital 
administrators  further  away  from  a  preoccupation  with  routine  operational 
issues  and  focusing  their  attention  on  issues  concerned  with  past  performance 
and  predicting  future  trends  in  new  technologies  and  clinical  program  via¬ 
bility.  As  a  technique  for  managing  changes  in  health  care  and  long-range 
hospital  planning,  the  strategic  planning  process  is  especially  suited  to 
address  the  need  of  hospital  managers  and  medical  staff  to  keep  abreast  of 
the  inevitable  changes  resulting  from  a  prospective  hospital  cost-based 
reimbursement  system  with  its  emphasis  on  cost  containment. 

One  of  the  specific  objectives  of  strategic  planning  in  health  care 
today  is  to  assist  hospital  managers  in  defining  the  hospital's  product. 
Merged  clinical  and  financial  records  for  a  specific  incident  of  care  as 
defined  by  Diagnosis  Related  Groups  or  some  alternative  clinical  program 
categories  will  provide  information  to  drive  the  strategic  planning  process 
and  allow  hospital  managers  to  analyze  case  mix,  monitor  admissions,  assess 
averge  length  of  stay,  and  manage  physician  practice  patterns.  These 
activities  are  viewed  as  prerequisites  for  determining  which  products  are 
winners  and  which  are  losers  and  will  help  guide  efforts  at  managing  changing 
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Use  of  Comparative  Data  in  Hospital  Management 

Sound  hospital  management  demands  in-depth  and  thorough  information 
about  utilization  that  accurately  reflect  the  complexities  of  patient  care. 
Dramatic  variations  in  admissions,  length  of  stay,  resource  consumption,  and 
ancillary  utilization  impact  decision-making  and  influence  the  direction  of 
the  hospital's  strategic  planning  process.  To  accurately  assess  hospital  and 
physician  performance,  and  to  further  define  the  hospital's  product  line, 
hospital  managers  must  have  national  comparative  data  that  account  for  these 
variations. 

Successful  strategic  planning  depends  on  the  ability  of  hospital  managers 
to  turn  available  data  into  information  necessary  for  monitoring  utilization. 

The  success  of  a  strategic  plan  is  dependent  upon  the  availability  of  reliable 
information  about  the  hospital's  internal  operations  and  the  hospital's  share 
of  the  health  care  marketplace.  A  comprehensive  comparative  data  base  pro¬ 
vides  this  valuable  information.  Utilized  as  an  integral  part  of  a  hospi¬ 
tal's  overall  information  system,  hospital  managers  can  use  such  data  to: 

*  Develop  a  credible  business  plan  based  on  case  mix. 

*  Define  and  measure  the  hospital's  "products." 

*  Monitor  changes  in  hospital  utilization  and  physician  practice  patterns. 

*  Relate  changes  in  patient  mix  to  changes  in  medical  staff  activity  or 
composition. 

*  Bridge  the  gap  between  operating  data  and  management  information 
requirements. 

*  Assess  market  share  and  competition. 

Comparative  data  allow  the  hospital  manager  to  monitor  changes  in 
utilization  within  the  hospital  over  time  ( longitudinal  analyses)  and  to 
compare  the  hospital  with  its  "peers"  at  any  given  point  in  time  (cross- 
sectional  analyses).  Criteria  for  selecting  peer  hospitals  is  largely 
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determined  by  the  objectives  of  the  hospital  manager's  strategic  plan,  but 
should  take  into  account  the  need  to  develop  measures  which  assess  hospital 
performance,  case  mix  complexity,  and  resource  utilization. 

Summary 

A  successful  strategic  management  plan  is  dependent  on  comparative  data 
that  will  enable  hospital  managers  to  make  accurate  assessments  of  hospital 
utilization  and  use  this  information  to  direct  the  institution.  To  remain 
viable  in  a  volatile  health  care  market,  effective  hospital  management 
implies  the  availability  of  comprehensive  and  accurate  comparative  infor¬ 
mation  about  hospital  utilization.  While  this  is  only  one  element  in  the 
overall  strategic  planning  process,  hospital  utilization  information  will 
enable  hospital  managers  to  evaluate  patterns  of  resource  within  their  own 
institutions  and  assess  utilization  changes  across  hospitals  which  are 
similar  to  theirs  in  location,  size,  and  patient  mix. 

The  complexities  of  hospital  management  require  a  strategic  planning 
approach  to  assessing  changes  in  utilization  especially  in  light  of  con¬ 
tinuing  changes  in  methods  of  defining  the  hospital's  products.  Comparative 
data  are  available  which  allow  flexibility  in  defining  a  hospital's  peer 
institutions  and  provide  year  by  year  comparisons  in  changing  patterns  of 
hospital  utilization.  Use  of  this  comparative  information  should  be  included 
as  an  integral  part  of  a  hospital's  overall  information  system  for  strategic 
planning  and  management. 


The  following  pages  contain  in  outline  form  the  substance  of  Dr.  Graves'  presen¬ 
tation  to  the  conference.  The  material  was  organized  around  the  framework  given 
in  Figure  1. 
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INFORMATION  FOR  DECISIONMAKERS  . 


INTERPRETATION  AND  ANALYSIS 


SPECIFICATION 
AND  TESTING 
OF  ANALYTICAL 
FRAMEWORK 


OPERATIONALIZATION  OF  CONCEPTS 


THEORETICAL  CONCEPTS 
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Figure  1. 

Framework  for  a  Hospital  Information  System 

Adapted  from:  James  T  Bonnen.  “Improving  Infor¬ 
mation  on  Agriculture  and  Rural  Life,”  American 
Journal  of  Agricultural  Economics,  Vol.  57,  No.  5, 
December,  1975  p.  758. 


REALITY 


The  Hospital  Must  Deliver  To  Its 
Patient  Mix  Quality  Care  At  A  Cost 
Below  Reimbursement 


THEORETICAL  CONCEPTS 
Product  Definition  (Clinical  Programs) 

Complexity 
Costliness 
Treatment  Patterns 

Treatment  Effectiveness  (Quality  of  Care) 

OPERATIONALIZATION  OF  CONCEPTS 

How  you  operationalize  your  theoretical  concepts  will  determine 
the  data  elements  you  need  in  your  data  base. 

To  operationalize  a  concept  means  to  put  the  concept  in  a 
form  that  permits  some  kind  of  measurement. 

DATA  REQUIREMENTS 

Availability  of  data  in  addition  to  the  UHDDS  elements: 

Care  unit  utilization 
Patient  specific  charges 
Hospital  Service  or  Clinical  Programs 
Physician-admitting,  consulting,  attending 


no 


COMPLEXITY 

The  concept  of  "complexity"  lacks  a  precise  definition.  Before 
hospital  case  mix  complexity  can  become  a  useful  factor  in  under¬ 
standing  hospital  costs,  a  clear  definition  of  the  concept  of 
complexity  must  be  established  and  an  operational  means  of  mea¬ 
suring  case  mix  developed. 

SEVERITY  OF  ILLNESS 

The  relative  level  of  loss  of  function  and  mortality  normally 
caused  by  a  particular  illness. 

RESOURCE  INTENSITY 

The  relative  volume  and  types  of  diagnostic,  therapeutic 
and  bed  services  used  in  the  management  of  a  particular  illness. 

ALTERNATIVE  PRODUCT  DEFINITIONS 
MDCs  and  DRGs 

CPHA's  Cross-Classification 
SysteMetrics '  Disease  Staging 
Severity  of  1 1 lness 
Patient  Management  Categories 
Or  A  Combination  Of  The  Above 


Grouping  Patients 


ABC 


All  of  your  hospital's  patients  art  aaignad 
to  the  appropriate  group  baaed  upon  each 
patient's  principal  diagnosis. 

The  example  dtows  one  possible  distribution. 


Your  Hospital's  Data 


Complexity  Index 


A 

(Case  Mix  Index) 

B  c 

Your  Hospital 

3oJ 

1700 

252  3500 1  457.  8000 

National 

4021 

i?ooj 

30?l  3500  I  307! 8000 

Since  the  Caea  Mix  Complexity  Index  is  a 
comparison  of  your  patient  distribution 
to  some  standard  or  norm  (here  the 
national  data  base),  only  your  hospital's 
proportions  are  used.  Your  hospital's 
costs  are  not  relevant  to  this  determination. 


CASE  MIX  COMPLEXITY  INDEX 

YOUR  HOSPITAL-^ 

30%  X  1700  +  25?  X  3500  -  45 1  X  8000 
402  X  1700  ♦  302  X  3500  *302  X  8000 

NATIONAL—*' 

510  •»  875  +  3600 
680  *  1050  2400 


The  calculations  are  shown.  The  numerator 
in  sach  instance  is  your  hospital's  data  (pro¬ 
portion  of  patients),  and  the  denominator 
utilizes  the  national  bass.  The  national  data 
base  average  cost  (weight)  per  group  Is  used 
in  both  numerator  and  denominator. 

The  Complexity  Index  calculated  for  your 
hospital  is  higher  than  most  of  those  listed 
In  the  September  30  issue  of  the  Federal 
Register. 


National  Data  last 


Sm  «h#  r*m  (id*  forth*  e*lcui*tionB. 


C*lcul£tln^_Your_F^*£lt£lJi^_E£ncl«nc2_JP£rfo™iin£«i)__Ind«i 


EFFICIENCY  (PERFORMANCE)  INDEX 
(CASE  MIX  ADJUSTED) 

COST  INDEX=  ■•ACTUAL  COST - 

EXPECTED  COST 

CHARGE  INDEX=  ACTUAL  CHARGES 
EXPECTED  CHARGES 

LOS  INDEX=  -ACJU.AL .  LOS _ 

EXPECTED  LOS 

FATALITY  I NOEX  »  -ACTUAL ..DEATHS _ 

EXPECTED  DEATHS 

ETC.  ETC. 


Th«  Efficiency  (Perforaance)  Index 
la  a  calculation  of  a  hospital 'a 
actual  perforaance  for  their  petlenta 
aa  compared  to  the  expected  perfor¬ 
aance  (suggested  etenderd  or  a  norm 
derived  froa  a  data  baae)  for  the 
eaae  groups  of  petlente. 


EFFICIENCY  (PERFORMANCE)  INDEX 


B 

Your  Hospital 


302 

|  1600 

252 

4000 

|  45  2 1 

|  9000  | 

National 

m 

1700 

257. 

3500 

1  45*l 

8000 

The  proportion  of  patients  la  your 
hospital's  and  la  aultlplled  by 
both  your  hospital's  perforaance 
(cost  weights)  and  the  perforaance 
of  the  data  base,  e.g.,  the  Medicare 
MEDPAJt  file. 


EFFICIENCY  (PERFORMANCE)  INDEX 
YOUR  HOSPITAL-^ 

30t  X  1 600  ♦  251  X  4000  *  451  X  9000 
30Z  X  1700  ♦  25Z  X  3500  ♦  451  X  60C0 

NATIONAL**'* 

480  ♦  1000  »  4050 
510  ♦  875  ♦  3600 

5530 

4905*1.1093  YOUR  HOSPITAL’S  I NDEJ( 


This  panel  displays  the  calculations. 
The  nuaerator  Is  your  hospital's  data 
(coat  figure)  and  the  denominator 
again  utilises  your  hospital 'a  pro¬ 
portions  of  patients  aultlplled  by 
the  national  coet  figure. 

Values  greater  than  1.00  are  preeuaa- 
bly  "bad."  Huabare  below  1.00  are 
presumably  "good.”  Values  froa  e 
selected  peer  group  of  hospitals  or 
othsr  sources  can  be  substituted 
for  the  national  data  baee. 
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HOSPITAL  COMPLEXITY  INDICES 


RN1  ,  HOSP .EXPECTED  AVERAGE  TOTAL  CHARGE*  PER  PATIENT 

NATIONAL  EXPECTED  AVERAGE  TOTAL  CHARGE*  PER  PATIENT 

*SPC  DATA  BASE  1976-1980 


PCI  .  HOSP  EXPECTED  AVERAGE  TOTAL  COST*  PER  CASE 

EXPECTED  AVERAGE  TOTAL  COST*  PER  CASE  ACROSS  PEERS 

*1982  SCHEDULE  OF  REIMBURSEMENT  RATES  FROM  NEW 
JERSEY  COST  INFORMATION  FOR  467  YALE  DRGs  (BY 
TEACHING  STATUS) 


ANC  .  HOSP  EXPECTED  AVERAGE  NURSING  COST*  PER  DIEM 

EXPECTED  AVERAGE  NURSING  COST*  PER  DIEM  ACROSS  PEERS 

•NEW  JERSEY  NURSING  -COST  FOR  67  ICD-9-CM  BASED  YALE 
DRGS 


icu-ccu  -  HOSP  EXACTED  AVERAGE  ICU-CCU_BAYS1.PER.CASE _ 

EXPECTED  AVERAGE  ICU-CCU  DAYS*  PER  CASE  ACROSS  PEERS 


•NATIONAL  PATIENT  SAMPLE  FILE  FOR  CY  1981  FOR  467  YALE 
DRGs  (WEIGHTS  FOR  TOTAL  PATIENTS  AND  PATIENTS  65*) 


EkD  .  HOSP  EXPECTED  AVERAGE  FATALITY  RATES* _ 

EXPECTED  AVERAGE  FATALITY  RATES*  ACROSS  PEERS 

•NATIONAL  PATIENT  SAMPLE  FILE  FOR  CY  1981  FOR 
467  YALE  DRGs  (WEIGHTS  FOR  TOTAL  PATIENTS  AND 
PATIENTS  65+) 


COMPARATIVE  DATA 


Longitudinal  -  your  hospital  historically 
Cross-sectional  -  your  hospital  compared  to  other  hospital 

ISSUES  TO  CONSIDER  IN  DEVELOPING 
A  COMPARATIVE  DATA  BASE 

Coverage 

Representativeness 

Data  quality-edit  checks  and  corrections  cycle 

Methodology-patient  exclusion  criteria 

Classif ication-meaningfulness  of  the  patient 
groupings  used 

APPLICATIONS  OF  COMPARATIVE  DATA 
Define  and  Measure  Hospital's  Product 
Monitor  Case  Mix  for  Changes  in  Type  of  Severity 
Capital  Formation 
Rate  Review 
Physician  Management 
Utilization  Control 
Reimbursement  Appeals 


US  ARMY  MEDICAL  TREATMENT  FACILITIES  VS.  PAS  HOSPITALS 


FYE  1982  FYE  1983 
Fiscal  Period 


EXPECTED  PATIENT  COSTLINESS  INDEX 


Peer  Hospital  Sample 
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FACTORS  AFFECTING  DRG  PROFITABILITY 

A.  ORG  Phenomenon 

1.  Research  Methodology 

a.  Purpose:  To  identify  impact  of  billing  data  on  DRG  profitability. 

b.  Approach: 

(1)  DRG  profit  and  loss  statements  from  case  mix  system  at  CPHA 

(2)  Use  of  nine  hospitals,  over  20,000  cases 

(3)  Medicare  only 

2.  "Profitability"  Defined 

a.  Profit  =  Reimbursement  -  Cost 

b.  Reimbursement  is  Medicare  payment  for  the  DRG 

c.  Cost  is  cost  of  covered  services 

3.  Potential  Problems  in  DRG  Rates 

a.  Use  of  billing  data  instead  of  medical  record  data 

b.  Incomplete  data  for 

(1)  Multiple  procedures  (Cardiac  Caths) 

(2)  Comorbidities  -  Conditions  brought  into  hospital  by  patient 

(3)  Complications  -  Conditions  developed  at  the  hospital 

c.  80%  of  hospitals  increased  case-mix  index  with  medical  record  data 

d.  Impact  may  be  creation  of  "winners'  and  "losers" 

4.  Comorbidity/Complications  (C.C. )  Adjustments 

a.  2,800  diagnoses  identified  as  C.C.  resulting  in  one  extra  day's 
stay  in  75%  of  cases 

b.  DRGs  recognized  that  C.C.  cases  would  consume  more  resources 

c.  Equated  geriatric  cases  to  C.C.  (e.g.,  over  70  =  respiratory 
failure) 

d.  No  identification  of  impact  of  multiple  comorbidities,  e.g., 
respiratory  failure,  kidney  failure,  other  body  systems  failure 

e.  Original  DRGs  used  LOS  rather  than  cost  to  identify  C.C.  relevance 

f.  208  DRGs  use  C.C.  to  define  DRG 


COMPARISON  OF  PROFITABILITY 
OF  COMOR BID/COMPLICATED 
V.S.  NONCOMORBID  DRG  PAIRS 
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THREE  DRGs  WHERE  COMQRBIOITIES  GIVE  LESS 
MONEY  THAN  NONCOMORBID  DRGs 


WEIGHT 

REIMBURSEMENT 

168 

Mouth  OP  w/C.C. 

.8631 

2,589 

169 

Moutn  OP  w/o  C.C. 

.8992 

2,698 

.  Difference 

“0091 

403 

Lymphoma,  Leukemia  w/C.C. 

1.1715 

3,515 

404 

Lymphoma,  Leukemia  w/o  C.C 

1.1787 

3.536 

Difference 

on 

452 

Complications  of  Treatment 

w/C.C.  .8492 

2,548 

453 

Complications  of  Treatment 

w/o  C.C.  .9020 

2,706 

Difference 

(158) 

DRGs  WHICH  PAY  MORE  FOR  LESS 

DRG  rates  were  statistically  derived  view  of  physician  behavior 

and  patient  condition 

Will  not  necessarily  provide  incentives  for  doing  more  work 


TWO  ORGs  WHICH  PAY  MORE  FOR  DOING  LESS 

DRG  411  History  of  Cancer,  No  Endoscopy 

Weight  .7221 

Approximate 
Reimbursement  $2,166 

DRG  412  History  of  Cancer  With  Endoscopy 

Weight  .3400 

Approximate 
Reimbursement  $1,020 


By  doing  Endoscopy,  Hospital  loses  $1,146 
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B.  Patient  Condition 

1.  Sicker  patients  require  more  resources 

2.  DRG  payment  does  not  take  condition  of  patient  into  account. 

3.  Therefore,  hospitals  treating  sicker  patients  will  probably  not 
fare  as  well  as  hospitals  treating  patients  who  aren't  as  ill. 

C.  Physician  Practice 

1.  Assumption:  Given  uniform  patients,  physicians  who  treat  cases  more 
efficiently  with  most  appropriate  mix  of  resources  will  be  more 
profitable. 

2.  Translation: 

a.  Minimize  testing 

b.  Shorten  length  of  stay 

c.  Develop  more  efficient  mix  of  diagnostic  and  treatment  protocols 

d.  Review  drug  usage 

D.  Impact  of  Coding  on  DRG  Profitablity 

1.  Need  for  accurate  coding 

2.  Need  for  specificity 

3.  Need  to  watch  for  Comorbidity/Complications 

4.  Need  to  identify  all  relevant  diagnoses 

E.  Accounting  Practices 

F.  Base  Rate  Determination 
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PERFORMANCE  MEASUREMENT  —  WHERE  CHAMPUS  FITS 

LTC  Joseph  C.  H.  Smith,  MSC,  USAF 
Dianne  K.  Reyer 

INTRODUCTION 

The  purpose  of  this  presentation  is  to  provide  you  with  some  sense  of  the 
impact  of  the  performance  measure  used  in  the  Direct  Care  System  on  CHAMPUS,  and 
to  provide  a  set  of  objectives  for  developing  a  performance  measure  which  would 
meet  the  needs  of  OCHAMPUS. 

NATURE  OF  THE  SYSTEM 

CHAMPUS :  The  Office  of  the  Civilian  Health  and  Medical  Program  of  the 
Uniformed  Services  (OCHAMPUS)  is  a  worldwide  program  that  provides  financial 
assistance  to  eligible  beneficiaries  for  health  care  received  from  civilian 
sources.  The  CHAMPUS  eligible  population  consists  of  active  duty  dependents, 
retirees  and  their  dependents,  and  surviving  dependents  of  deceased  members  of 
the  services.  CHAMPUS  eligibility  for  retirees  and  their  dependents  is  ter¬ 
minated  upon  attaining  eligibility  for  MEDICARE.  CHAMPUS,  however,  is  not  a 
total  health  benefit  program.  It  was  designed  to  augment  medical  services 
available  through  the  military  treatment  system.  This  results  in  a  two-part 
health  benefit  delivery  system  with  many  adverse  incentives  and  unique 
problems . 

Health  Benefit  Delivery:  This  two-part  system  must  provide  a  health  benefit 
to  the  entire  beneficiary  population.  The  direct  care  part  of  the  system  provi¬ 
des  the  total  health  benefit  to  active  duty  beneficiaries,  and  provides  care  on 
a  space  available  basis  to  the  remaining  beneficiaries.  In  the  event  that  the 
direct  care  system  cannot  provide  care  to  the  active  duty  member,  or  in  the 
event  of  required  emergency  care,  the  Direct  Care  System  must  buy  the  care  from 
the  private  sector,  or  from  other  government  agencies  such  as  the  VA.  On  the 


other  hand,  if  care  cannot  be  provided  to  other  beneficiaries,  the  beneficiaries 
may  be  sent  to  receive  care  from  civilian  sources  under  CHAMPUS.  Non-active 
duty  benef ici ar ies  may  choose  to  receive  outpatient  care  under  CHAMPUS  without 
first  going  to  an  MTF,  and  if  they  live  outside  an  MTF  catchment,  they  may 
choose  CHAMPUS  for  inpatient  care  also. 

In  summary,  only  the  active  duty  beneficiary  is  guaranteed  his  total  health 
benefit  from  a  single  source.  All  other  beneficiaries  must  rely  on  available 
space  in  the  Direct  Care  System,  and  use  CHAMPUS  otherwise.  Both  the  Direct 
Care  System  and  CHAMPUS  must  purchase  care  from  the  private  sector. 

F inancing :  The  split  between  the  two  systems  is  further  exacerbated  by 
the  fact  that  each  piece  has  a  separate  appropriation,  and  can  only  spend  its 
respective  budgets  in  certain  ways.  CHAMPUS  generally  can  only  use  CHAMPUS 
funds  to  purchase  care  from  civilian  sources.  The  funds  cannot  buy  care  in 
other  Federal  facilities,  nor  can  the  funds  be  transferred  to  the  Direct  Care 
System  to  expand  in-house  capability.  Because  of  this  separate  funding,  care 
rendered  in  the  Direct  Care  System  is  free  from  the  CHAMPUS  perspective,  and 
care  rendered  under  CHAMPUS  is  free  from  the  Direct  Care  System  perspective. 

For  care  provided  under  the  CHAMPUS  program,  the  beneficiary  generally 
shares  in  the  cost  of  that  care.  The  fact  that  the  beneficiary  shares 
in  the  cost  complicates  the  problem  since  bringing  the  care  in-house  may 
result  in  increased  cost  to  the  government. 

Another  complication  in  the  financing  arena  is  other  health  insurance. 
CHAMPUS  is  second  pay  to  all  other  health  insurance  except  supplemental  plans. 
This  means  that  if  the  beneficiary  has  other  health  insurance,  the  other  insur¬ 
ance  must  pay  first,  and  CHAMPUS  will,  to  a  certain  extent,  pay  the  remaining 
bill.  This  process  is  known  as  coordination  of  benefits  (COB),  and  again  re¬ 
duces  the  government's  liability  for  health  care  costs.  The  Direct  Care  System 
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does  no  coordination  of  benefits  at  this  time;  however,  there  are  proposals  be¬ 
fore  Congress  which  would  require  the  Direct  Care  System  to  coordinate  benefits. 

PERFORMANCE  MEASURES 

Defined:  As  used  in  this  paper,  performance  measures  are  values  which 
represent  the  output  or  ratios  of  outputs  from  and  inputs  to  a  system.  For 
example,  the  number  of  admissions  per  beneficiary  per  year  would  be  a  perfor¬ 
mance  measure  for  the  Direct  Care  System.  Another  example  would  be  the  number 
of  outpatient  visits  per  physician  per  year.  On  the  other  hand,  the  number  of 
physicians  in  the  system  would  not  be  considered  a  performance  measure,  but 
rather  a  measure  of  resources  or  input  to  the  system.  However,  it  is  clear  that 
if  one  is  to  discuss  ratios  of  outputs  and  inputs  as  performance  measures,  one 
is  vitally  concerned  with  the  measurement  of  inputs  in  order  to  develop  perfor¬ 
mance  measures. 

Objectives :  The  primary  reasons  for  developing  performance  measures,  and 
going  to  the  trouble  of  collecting  and  storing  the  measurements,  is  to  be  able 
to  manage  the  system  better.  Performance  measurements  as  discussed  here  are  to 
be  used  to  compare  the  performance  of  one  facility  with  another,  and  to  compare 
the  performance  of  the  MHSS  with  other  health  care  delivery  systems. 

Furthermore,  they  are  to  be  used  as  the  basis  for  budgeting  and  resource  alloca¬ 
tion.  The  purpose  of  this  paper  is  to  provide  objectives  for  performance 
measures  for  the  MHSS  which  would  meet  the  needs  of  OCHAMPUS.  The  proposed  list 
of  objectives  is: 

(1)  The  performance  measures  must  be  able  to  be  implemented.  The  ideal 
measures  of  performance  in  the  medical  system  are  likely  to  involve  such 
intangibles  as  health  status  and  beneficiary  satisfaction.  While  these  attribu¬ 
tes  are  desirable,  it  is  extremely  difficult  to  develop  reliable  or  valid 
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approaches  to  measuring  these  variables,  and  measures  which  are  developed  are 
likely  to  be  very  costly  to  collect.  One  must  accept  compromises  in  order  that 
we  can  actually  do  the  required  measurement. 

(2)  The  performance  measures  must  establish  an  incentive  structure  which 
promotes  the  choice  of  treatment  plans  which  is  most  cost  effective  to  the 
government.  Under  the  current  system  of  using  the  Composite  Work  Unit  (CWU)  as 
the  primary  measure  of  system  output,  the  incentive  is  to  provide  more  care. 

The  incentive  in  the  fee-for-service  system  in  the  private  sector  has  been 
blamed  to  a  large  extent  for  the  high  increases  in  the  cost  of  health  care  in 
this  country,  and  is  the  primary  system  affecting  the  utilization  of  CHAMPUS. 

For  example,  the  Military  Health  Services  System  (MHSS),  the  combined  CHAMPUS 
and  Direct  Care  System,  produces  1,084,535  admissions  per  year,  and  7,102,163 
hospital  days  per  year.  This  level  of  output  is  impressive;  however,  it  is  pro¬ 
bably  signif icantly  more  output  than  is  really  necessary  to  provide  a  high 
quality  health  benefit  to  our  population.  Specifically,  if  one  projects  annual 
utilization  for  our  population  enrolled  in  a  Kaiser  type  health  maintenance 
organization.  One  gets  a  requirement  for  only  665,840  admissions  and  3,343,556 
hospital  days.  That  means  that  we  are  providing  60%  more  admissions  than  really 
necessary,  and  more  than  double  the  number  of  required  hospital  days. 

(3)  The  performance  measures  must  provide  the  incentive  for  the  system  to 
rearrange  itself  in  order  to  improve  the  overall  efficiency.  Under  the  current 
system  of  separate  budgets,  if  the  MTF  Commander  is  faced  with  the  choice  of 
spending  $25,000  from  his  supplemental  funds,  or  sending  the  patient  to  CHAMPUS 
for  a  government  cost  of  $30,000  from  the  CHAMPUS  budget,  he  is  most  likely  to 
choose  the  CHAMPUS  option.  A  performance  measure  by  itself  cannot  solve  this 
problem,  but  it  could  serve  to  make  people  more  aware  of  the  problem,  and  pro¬ 
mote  the  appropriate  changes. 
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(4)  The  performance  measures  should  consider  all  inputs  into  the  system, 
and  all  outputs  from  the  system.  They  should  allow  one  to  look  at  the  various 
segments  of  the  system,  or  at  the  system  as  a  whole.  Certainly  the  cost  of 
CHAMPUS  is  an  input  to  the  system,  and  the  care  delivered  under  CHAMPUS  is  an 
output  from  the  system.  Both  should  be  considered  when  looking  at  the  total 
system.  At  the  same  time,  the  Direct  Care  System  has  multiple  missions.  It 
must  be  prepared  for  mobilization,  it  must  provide  a  health  benefit,  and  it  must 
support  the  requirements  of  the  line.  Some  activities  of  the  MTF  may  support 
all  three  missions,  while  other  activities  may  support  one  mission  and  detract 
from  another.  The  output  in  support  of  the  separate  missions  should  all  contri¬ 
bute  to  system  output,  and  still  should  remain  separable. 

(5)  The  value  of  the  performance  measure  associated  with  a  unit  of  output 
should  be  proportional  to  the  quantity  of  resources  normally  required  to 
generate  the  output.  For  example,  a  hospital  day  would  probably  be  worth  more 
than  an  outpatient  visit,  and  a  day  in  an  intensive  care  unit  should  be  worth 
more  than  a  regular  hospital  day. 

(6)  The  performance  measures  should  promote  effective  Coordination  of  Bene¬ 
fits  (COB).  This  requirement  is  difficult  since  the  amount  of  potential  COB 
recovery  is  different  in  different  areas;  however,  COB  does  reduce  total  cost  to 
the  government  and  should  be  encouraged. 

(7)  The  performance  measures  should  promote  high  quality  output.  An  out¬ 
patient  visit  which  does  not  effectively  solve  a  problem  should  not  be  as 
valuable  as  one  which  does  solve  the  probler  'n  a  similar  manner,  a  records 
system  which  provides  for  continuity  of  care  --  records  access  --  when  an  MTF 
patient  receives  care  under  CHAMPUS  should  result  in  higher  quality,  more  effec¬ 
tive  care.  One  would  hope  that  the  performance  measures  would  promote  such  a 
system  to  the  extent  it  is  worthwhile. 
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(8)  The  performance  measures  should  allow  us  to  compare  the  performance  of 
our  health  benefit  delivery  system  with  other  health  benefit  delivery  systems. 
Even  though  there  is  no  other  health  benefit  delivery  system  which  is  truly  com¬ 
parable  to  ours,  we  must  be  able  to  look  at  our  performance  and  compare  it  with 
other  systems.  We  may  be  providing  six  outpatient  visits  per  eligible  benefi¬ 
ciary  per  year  compared  with  about  four  visits  per  member  per  year  for  a  Kaiser, 
but  it  may  be  that  we  are  counting  telephone  consultations  as  visits  while 
Kaiser  is  not.  Or,  we  may  be  counting  immunizations  as  a  visit,  and  we  must 
provide  significantly  more  immunizations  per  year  because  of  line  requirements 
to  have  shot  records  always  current. 

(9)  The  performance  measures  should  be  adaptable  to  the  changing  world.  The 
CWU  has  remained  static  for  a  number  of  years  while  the  practice  of  medicine  has 
changed  significantly.  It  is  very  difficult  to  get  such  a  measurement  changed; 
therefore,  rather  than  being  a  management  tool,  the  measurement  can  become  the 
master.  The  proposed  Health  Care  Unit  (HCU)  is  tied  to  the  Uniform  Chart  of 
Accounts  (UCA)  so  that  as  the  UCA  adapts  to  account  for  the  resources,  the  out¬ 
put  measure  changes. 

SUMMARY 

We  are  faced  with  a  health  care  delivery  system  which  is  split  into  two  separate 
parts,  and  which  has  numerous  competing  missions.  With  the  current  performance 
measures,  major  portions  of  the  system  are  completely  ignored,  and  it  is  not 
possible  to  measure  our  performance  in  the  health  benefits  arena  against  the 
performance  of  other  health  benefit  delivery  systems.  The  result  is  an  inabi¬ 
lity  to  support  required  programs,  and  the  creation  of  incentives  which  increase 
government  costs.  In  this  presentation,  we  have  developed  a  set  of  objectives 
for  performance  measures  which,  if  satisfied,  will  meet  our  needs.  The  next 
seeps  in  the  process  are  to  develop  criteria  for  each  objective,  and  give 
appropriate  weights  to  each  criteria. 


HOSPITAL  CASE  MIX: 

CURRENT  RESEARCH  EFFORTS  BY  THE  NAVY  MEDICAL  DEPARTMENT 

CDR  Karen  A.  Rieder,  NC,  USN 

INTRODUCTION 

During  the  past  three  years,  the  Health  Care  Services  Research  Department  at 
the  Naval  School  of  Health  Sciences,  Bethesda  Maryland,  has  been  conducting 
ongoing  research  in  hospital  case  mix  to  answer  the  following  questions: 

1.  How  does  the  Diagnosis  Related  Groups  (DRG)  classification  scheme  com¬ 
pare  to  the  patient  grouping  strategies  normally  used  by  the  Navy? 

2.  Can  DRGs  be  used  as  an  analysis  tool  for  monitoring  hospital  performance 
and  for  allocating  resources  within  the  Navy? 

3.  Are  DRGs  a  valid  patient  grouping  method  for  the  Navy  Medical  Depart¬ 
ment,  or  are  there  additional  variables,  some  of  which  may  be  unique  to  military 
settings,  that  will  help  explain  differences  among  facilities  in  relation  to 
length  of  stay? 

The  Navy's  interest  in  DRGs  developed  as  an  attempt  to  help  explain  the  con¬ 
siderable  variation  in  length  of  stay  data  across  Navy  hospitals.  The  commonly 
used  workload  measures,  such  as  admissions,  outpatient  visits,  occupied  bed 
days,  and  the  Composite  Work  Unit  provided  minimal  insight  into  the  reason  for 
these  variations.  Therefore,  the  frequently  accepted  explanation  that  differen¬ 
ces  in  hospital  case  mix  are  a  major  cause  of  these  variations  was  examined. 
Grouping  Strategy 

To  ascertain  the  efficacy  of  DRGs  as  a  patient  grouping  strategy,  their 
ability  to  account  for  variation  in  length  of  stay  was  compared  to  four  alter¬ 
native  grouping  techniques.  These  techniques  are  common  classification  methods 
used  by  the  Navy  to  group  patients  and  include  either  (1)  major  diagnosis  class, 
(2)  disease  subcategory,  (3)  three  digit  diagnosis  code,  or  (4)  disease  sub¬ 
category  subdivided  by  surgery  and  complications. 
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To  test  DRGs  as  an  alternative  patient  grouping  method,  it  was  necessary  to 
convert  the  diagnosis  and  surgery  codes  used  by  the  Navy  to  the  codes  required 
to  assign  a  patient  to  a  DRG.  The  Research  Department  staff  completed  a  com¬ 
puter  edit  which  replaced  each  ICD-9  (International  Classification  of  Diseases, 
9th  Revision)7  diagnosis  code  and  ICPM  (International  Classification  of 
Procedures  in  Medicine)^  surgery  code  with  an  appropriate  ICD9-CM  (Clinical 
Modif ication)!  code.  This  procedure  was  not  designed  to  provide  an  exact 
mapping  between  the  two  systems  but  ensured  that  patient  records  were  assigned 
to  the  correct  DRG.  Ninety  percent  of  the  1980  inpatient  records  were  assigned 
to  a  ORG:  the  other  10%  had  ICD-9  codes  that  were  so  vague  they  could  not  be 
converted,  or  the  charts  had  incomplete  date'  The  total  sample  consisted  of 
approximately  188,000  inpatient  records.  As  an  aside,  the  1982  inpatient 
records  were  recently  converted  using  a  revised  conversion  process.  This  time, 
98%  of  the  records  were  assigned  to  a  DRG.  This  computer  bridge  has  been  shared 
with  the  Army  and  is  currently  being  validated  through  a  contract  with  the 
Commission  on  Professional  and  Hospital  Activities  (CPHA). 

Findings  indicated  that  DRGs  explained  significantly  more  of  the  total 
variation  in  length  of  stay  for  patients  at  naval  hospitals  than  the  other 
currently  used  grouping  techniques.  As  indicated  in  Table  1,  DRGs  accounted  for 
about  25%  of  the  variation  in  patient  length  of  stay,  which  was  more  than  any  of 
the  comparison  methods  tested.  Using  a  F-test,^  this  difference  was  found  to  be 
statistically  significant.  Perhaps  more  importantly,  DRGs  required  that  the 
sample  be  subdivided  into  445  groups  whereas  grouping  the  sample  by  principal 
diagnosis  code  required  905  groups.  This  smaller  number  has  definite  benefits 
for  hospital  utilization  review  since  there  are  fewer  patient  groups  to  monitor. 

However,  we  also  found  that  DRGs  were  able  to  explain  less  than  25%  of  the 
total  variation  in  contrast  to  the  New  Jersey  study  findings  which  accounted  for 
43%  of  the  variation  in  length  of  stay.^ 


Therefore,  DRGs  are  a  better  method 


PERCENTAGE  OF  VARIATION  IN  LENGTH  OF  STAY 
EXPLAINED  BY  SELECTED  PATIENT  GROUPING  METHODS 
NAVAL  HOSPITALS,  CY  1980 


Patient  Grouping  Method 

Number 

of 

Groups 

Percent  of 
vari ation 
explained 

Comparison  methods 

Major  diagnostic  class 

18 

5.1 

Disease  subcategory 

118 

15.5 

Disease  subcategory  by 

subcategory  and  complications 

451 

20.8 

Three  digit  diagnosis  code 

905 

21.4 

DRGs 

445 

24.5 

for  explaining  variation  in  LOS  data  than  the  methods  normally  used  by  the  Navy 
to  classify  patients,  but  they  explain  less  variation  than  in  the  civilian  com¬ 
munity. 

Hospital  Performance 

To  evaluate  the  use  of  DRGs  to  monitor  hospital  performance,  two  studies 
have  been  conducted.  One  used  length  of  stay  as  the  dependent  variable  while 
the  second  looked  at  convalescent  leave  days. 

The  length  of  stay  (LOS)  study  was  undertaken  to  demonstrate  how  the  DRG 
methodology  could  be  used  as  an  analysis  tool  to  investigate  very  practical 
management  and  quality  assurance  questions.  Evaluating  differences  in  length  of 
stay  is  difficult  because  length  of  treatment  can  be  dramatically  affected  by 
many  factors,  such  as  diagnosis,  disease  severity,  hospital  characteristics,  as 
well  as  administrative  policies  and  procedures,  and  individual  physician  prac¬ 
tice.  The  ability  to  identify  comparable  groups  of  patients  is  especially 


important  because  naval  hospitals  are  so  diverse  with  respect  to  location, 
teaching  status,  and  workload  (Table  2).  Therefore,  in  order  to  conduct  utilization 
review,  one  must  be  able  to  identify  and  have  access  to  length  of  stay  data  for 
specific  groups  of  patients  that  are  comparable  across  all  the  hospitals  being 
studied. 

Table  2 

LOCATION,  TEACHING  STATUS,  AND  WORKLOAD 
NAVAL  HOSPITAL,  FY  1980 


Teaching  Status  Number 

Residency  Teaching  Hospitals  4 

Family  Practice  Teaching  Hospitals  5 

Non-teaching  Hospitals  26 

Total  35 

Location  Number 

United  States  24 

Outside  U.S.,  ashore  _11 

Total  35 

Workload  Range 

Admissions  95  to  31143 

Daily  average  number  of 

patients  occupying  beds  1  to  499 

Average  length  of  stay  (days)  2  to  9 


To  do  this,  LOS  data  from  matched  hospitals  were  adjusted  to  determine  to 
what  extent  variations  could  be  accounted  for  by  patient  case  mix.  The  results 
indicated  that  case  mix  differences  explained  some,  but  not  all,  of  the  dif¬ 
ferences  in  total  LOS  between  similar  facilities.  In  fact,  for  specific  DRGs, 
average  length  of  stay  at  one  facility  was  over  twice  as  long  as  that  for  the 
comparison  hospitals. 
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These  results  could  be  interpreted  in  at  least  two  alternative  ways.  First, 
at  those  facilities  with  longer  lengths  of  stay,  administrative  policies  and 
procedures  could  be  contributing  to  the  unexplained  variance.  If  this  explana¬ 
tion  is  correct,  medical  audits  could  be  conducted  to  identify  specific  problem 
areas  where  policy  changes  could  help  realign  a  facility’s  average  length  of 
stay  data  with  its  peers.  The  second  interpretation  is  that  DRGs  do  not  ade¬ 
quately  account  for  differences  in  patient  case  mix  among  naval  hospitals.  One 
could  argue  that  patients  at  some  hospitals  require  more  medical  attention  than 
patients  at  another  facility.  For  example,  DRGs  do  not  account  for  progression 
or  stage  of  disease,  a  fact  well  discussed  in  the  recent  literature. ^  Perhaps 
the  ‘'outlier"  facility  had  a  higher  proportion  of  patients  at  a  more  serious 
stage  of  illness  who  required  in-creased  treatment  time. 

The  second  study  using  DRGs  to  monitor  hospital  performance  compared  the 
four  patient  grouping  methods  for  their  ability  to  explain  differences  in  con¬ 
valescent  leave  (CL)  among  active  duty  Navy  and  Marine  Corps  personnel.  The 
assumption  made  was  that  patients  within  a  DRG  tend  to  require  the  same  amount 
of  convalescent  leave. 

For  15,791  dispositions  during  1980,  results  indicated  that  the  greatest 
amount  of  variation  in  convalescent  leave  -  40.9%  -  was  accounted  for  by  the  DRG 
methodology  (Table  3),  and  that  10  DRGs  generated  41%  of  the  CL  days  for  active 
duty  personnel.  Although  DRGs  accounted  for  a  slightly  higher  percentage  of  the 
variance  than  three  digit  diagnosis  codes  (40.9%  versus  40.1%),  this  difference 
was  not  statistically  significant.  However,  DRGs  were  able  to  explain  this 
slightly  higher  variance  by  dividing  the  population  into  fewer  groups  (350 
groups  for  DRGs  versus  634  groups  for  three  digit  diagnosis  codes).  This  lower 
number  of  groups  is  desirable  because  it  is  much  easier  for  a  manager  to  monitor 
fewer  patient  groupings. 
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Table  3 

PERCENTAGE  OF  VARIATION  IN  RECOMMENDED  CONVALESCENT  LEAVE  DAYS 
ACCOUNTED  FOR  BY  SELECTED  PATIENT  GROUPING  METHODS  FOR  ACTIVE  DUTY  NAVY 
AND  MARINE  CORPS  PERSONNEL,  CY  1980 


Patient  Grouping  Method 

Number 

of 

Groups 

Percent  of 
vari at  ion 
explained 

Major  diagnostic  class 

22 

20.9 

Disease  subcategory 

112 

28.4 

Disease  subcategory  by 

subcategory  and  complications 

376 

36.4 

Three  digit  diagnosis  code 

634 

40.1 

DRGs 

350 

40.9 

The  remainder  of  the  study  estimated  the  potential  savings  in  lost  work  time 
that  could  result  by  reducing  CL  days  for  those  DRGs  that  exceed  the  average 
amount  recommended  by  all  naval  facilities.  This  was  calculated  from  a  formula 
that  delineated  the  number  of  dispositions  within  the  selected  DRG  at  a  specific 
hospital  and  the  average  number  of  convalescent  leave  days  granted  by  all  faci¬ 
lities  for  that  ORG. 

In  order  to  develop  a  patient  grouping  method  based  on  DRGs  that  would  opti¬ 
mally  account  for  differences  in  convalescent  leave  among  active  duty  Navy  and 
Marine  Corps  personnel,  one  should  consider  at  least  two  further  steps.  The 
first  strategy  is  to  recategorize  variables  used  by  the  DRGs  developers  into 
groups  that  are  more  appropriate  to  the  active  duty  population  in  terms  of  age. 
Secondly,  certain  DRGs  may  also  be  combined,  thus  reducing  the  total  number  of 
groups  needed  to  account  for  differences  in  convalescent  leave.  For  example, 
DRGs  159162  are  used  to  group  patients  over  age  17  who  required  surgical  repair 


of  a  hernia.  For  these  four  DRGs  the  mean  convalescent  leave  days  ranged  from 
15.7  to  17.3.  Since  there  is  very  little  difference  in  convalescent  leave  among 
these  DRGs,  they  could  be  collapsed  into  one  group. 

The  results  of  these  two  studies  demonstrate  that  DRGs  can  be  used  to  moni¬ 
tor  not  only  length  of  stay  but  that  patient  groupings  based  on  DRGs  may  also  be 
homogeneous  with  respect  to  other  variables,  such  as  convalescent  leave. 

The  next  question  becomes  how  can  this  system  impact  on  resource  allocation? 
Allocate  Resources 

Though  a  case  mix  index  based  on  length  of  stay  is  appropriate  for  use  in 
utilization  review,  one  based  on  relative  costs  per  patient  case  is  desirable  to 
make  resource  allocation  decisions.  Yet,  at  the  present  time,  Navy  medical 
accounting  systems  do  not  allow  for  detailed  cost  determinations.  Hospital  and 
clinic  reimbursement  are  based  on  the  Composite  Work  Unit  (CWU)  which  does  not 
account  for  differences  in  patients  treated  i.e.,  case  mix.  Our  FY  85  research 
efforts  will  be  directed  towards  the  application  of  case  mix  methodology  to  the 
allocation  of  resources  to  Navy  medical  facilities.  During  this  study  we  will 
identify,  compare,  and  evaluate  the  implications  of  allocating  resources  using 
various  techniques  that  account  for  patient  case  mix.  Resource  allocation 
systems  developea  by  civilian  researchers  and  the  Veterans  Administration  will 
be  studied  for  their  possible  application  to  the  Navy  Health  Care  System. 

Uni que  Mi  1 i tary  Variables 

As  a  result  of  our  studies  we  have  found  that  DRGs,  although  better  than 
other  currently  used  grouping  methods,  explained  less  variation  in  patient 
length  of  stay  for  naval  hospitals  than  that  reported  by  civilian  researchers. 
This  is  partly  due  to  the  fact  that  current  Navy  diagnosis  and  surgery  codes  are 
less  precise  than  the  codes  used  by  the  DRGs.  But  to  evaluate  whether  the 
current  DRGs  are  a  valid  grouping  method  for  the  Navy,  we  have  completed  two 


further  studies. 
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First,  we  explored  whether  other  factors  already  coded  on  the  Navy  discharge 
abstract  data  base  significantly  contributed  to  length  of  stay  variance  in  naval 
nospitals.  Second,  we  investigated  differences  in  severity  of  illness  within  a 
DR6  as  one  possible  explanatory  variable  using  a  nursing  patient  classif ication 
system,  which  provided  an  estimate  of  the  daily  nursing  care  hours  each  patient 
requires.  If  one  accepts  the  assumption  that  more  severely  ill  patients  require 
increased  nursing  care,  patient  classification  may  serve  as  an  estimate  of 
patient  disease  severity  within  DRGs  and  is  data  already  being  collected  by  all 
of  our  hospitals  for  estimating  staffing  levels.  The  results  of  both  of  these 
studies  will  be  presented  at  a  future  session. 

In  conclusion,  the  studies  conducted  by  the  Research  Department  have 
demonstrated  that  DRGs  are  a  preferred  grouping  strategy  which  could  assist 
military  program  managers  to  monitor  hospital  performance  and  to  allocate 
resources.  The  findings  also  suggests  that  some  modifications  of  the  DRG 
grouping  could  be  necessary  to  increase  its  applicability  for  the  military 
setting. 


» 
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THE  AMEDD  PERFORMANCE  MEASUREMENT  STUDY 
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BACKGROUND  AND  PURPOSE 

In  1982  the  US  Army  Health  Services  Command  (HSC)  Productivity  Study  staff 
surveyed  a  cross-section  of  Army  Medical  Department  (AMEDD)  health  care  admin¬ 
istrators  and  providers  to  assess  satisfaction  with  the  extent  to  which  the 
traditional  measure  of  patient  care  workload,  the  Medical  Care  Composite  Unit 
(MCCU),  represented  accurately  the  major  patient  care  output  of  Army  Medical 
Treatment  Facilities  (MTF).  It  was  determined  that  the  MCCU  was  no  longer  ade¬ 
quate  for  use  in  measures  of  system  performance  needed  for  resource  justifica¬ 
tion  and  resource  allocation,  nor  individual  performance  including  productivity 
and  quality  assurance.  It  was  apparent  that  the  MCCU  did  not  give  appropriate 
credit  for  the  increased  sophistication  of  ambulatory  medicine,  the  added 
requirements  of  quality  assurance  activities,  nor  many  non-patient  care  func¬ 
tions  to  include  readiness  and  training.  Therefore,  the  Army  Surgeon  General 
directed  HSC  to  conduct  a  study  with  the  following  purpose: 

To  evaluate  current  measures  of  AMEDD  health  care 
system  performance  and,  as  required,  develop  better 
measures  and  workload  data  capture  systems  which 
accurately  reflect  actual  resource  utilization. 


SYSTEMS  APPROACH  TO  PRODUCTIVITY  MEASUREMENT 
To  develop  a  system  of  performance  measurement,  it  was  necessary  to  devise  a 
framework  within  which  to  consider  productivity  and  performance  measurement.  Of 
course,  one  philosophy  of  productivity  improvement  is  that  given  in  Figure  1. 
However,  in  our  approach  we  consider  productivity  to  be  a  systemic  concept  con¬ 
cerning  the  conversion  of  inputs  to  outputs  by  the  system  under  consideration. 
This  concept  can  best  be  applied  to  the  hospital  setting  utilizing  a  systems 
approach  as  depicted  in  Figure  2.  The  "raw"  inputs  to  the  hospital,  labor, 


as 


Figure  1 
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materials  and  services,  and  captial  can  now  be  identified  consistently  across 
all  military  hospitals  and  allocated  to  various  workcenters,  departments  or 
"producing  units"  under  the  Uniform  Chart  of  Accounts  (UCA).  The  work  centers 
or  departments  produce  what  might  be  called  "intermediate  outputs"  such  as 
laboratory  tests,  radiology  studies,  prescriptions,  nursing  manhours  of  care, 
and  patient  days.  Notice  that  patient  days  are  considered  only  intermediate 
products  in  this  model,  a  major  difference  from  the  MCCU  approach.  All  of  these 
intermediate  products  are  ordered  and  combined  by  the  professional  component  of 
care  into  final  products.  Ideally,  these  final  products  would  be  such  esoteric 
outputs  as  change  in  health  status,  medical  and  patient  or  community  education, 
or  medical  readiness.  However,  in  the  absence  of  satisfactory  measures  for  such 
output,  we  usually  choose  some  measure  of  care  provided  or  services  delivered  as 
the  final  product.  As  indicated  earlier,  the  MCCU,  which  has  been  the  tradi¬ 
tional  measure  of  fine'  oroduct,  is  not  a  very  good  one.  This  idea  will  be 
expanded  in  the  next  section. 

By  separating  the  process  of  health  care  delivery  into  the  above  components, 


two  very  important  ideas  can  be  introduced.  The  performance  of  the  producing 
units  may  be  measured  by  looking  at  what  might  be  called  "efficiency  ratios" 
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such  as  lab  tests  per  manhour,  nursing  hours  per  bed  day,  or  cost  per  prescrip¬ 
tion.  The  performance  of  the  professional  combining  mechanism  may  be  evaluated 
by  determining  "efficacy  ratios"  which  compare  how  effectively  and  appropriately 
the  intermediate  products  have  been  ordered  and  utilized  to  produce  the  final 
product.  As  will  be  seen  later,  it  is  very  important  to  distinguish  between 
these  two  related  but  different  components  of  productivity.  It  may  be  that  each 
of  the  support  departments  in  a  given  hospital  may  be  the  most  efficient  in, 
say,  a  region.  That  is,  the  laboratory  may  have  the  lowest  cost  per  weighted 
procedure,  the  radiology  department  may  have  the  greatest  output  per  manhour, 
etc.  But,  the  lab  tests  and  radiology  films  may  have  been  ordered  in  far  too 
great  a  quantity  for  the  severity  level  and  case  mix  of  the  final  patient  pro¬ 
duct  of  the  hospital.  This  may  be  true  conceptually,  but  it  will  continue  to  be 
impossible  to  detect  until  a  measure  of  hospital  output  is  developed  that  is 
sensitive  to  real  differences  in  case  mix. 

NEED  FOR  IMPROVED  MEASURES  OF  OUTPUT 

The  MCCU  has  been  utilized  as  a  measure  of  hospital  output  since  about  1957. 

During  this  period  it  has  been  virtually  unchanged,  although  the  practice  of 

medicine  has  changed  dramatically.  The  MCCU,  or  Composite  Work  Unit  (CWU)  as  it 

is  known  in  the  other  services,  is  defined  as  follows: 

MCCU  =  (10  x  ADM)  +  (10  x  LB)  +  OBD  +  (0.3  x  OPV ) 

where,  ADM  =  Number  of  Admissions 

LB  =  Number  of  Live  Births 
OBD  =  Number  of  Occupied  Bed  Days 
OPV  =  Number  of  Outpatient  Visits. 

Several  problems  with  the  MCCU  are  well  known.  First  is  the  fact  that  a 
major  component  of  the  MCCU  is  clearly  an  intermediate  product  rather  than  a 
final  product.  Figure  3  clearly  shows  the  impact  of  this  shortcoming.  In  the 
sense  of  performance  assessment,  clinic  visits  and  inpatient  dispositions  can  be 
considered  as  final  products  which  contribute  to  our  mission-providing  patient 
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Figure  3 

MCCU  COMPONENT  TRENDS 
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ADCV  -  AVERAGE  DAILY  CLINIC  VISITS 
ADD  -  AVERAGE  DAILY  DISPOSITIONS 
ADPL  -  AVERAGE  DAILY  PATIENT  LOAD 
ALOS  -  AVERAGE  LENGTH  OF  STAY 


SOURCE:  IPOS  Worldwide  Database.  All  patients  including  transfers. 


care.  However,  patient  days  are  intermediate  products  which  are  not  an  "end" 
but  a  "means  to  an  end,"  the  end  being  treated  patients.  Changing  patterns  of 
utilization,  medical  practice,  and  technological  advances  have  very  signifi¬ 
cantly  reduced  the  average  length  of  patient  stay  and,  therefore,  average  daily 
patient  load  or  beds  occupied.  This  has  resulted  in  a  leveling  off  of  the  MCCU. 
The  reduced  patient  stays  are  clearly  shorter  at  the  end  of  the  stay  rather  than 
the  beginning.  This  means  fewer  less  costly  "hotel  services"  are  being  provided 
but  the  intensity  of  resources  applied  to  the  actual  acute  days  of  care  has 
increased.  Certainly  there  is  no  less  involvement  of  physicians,  support  staff 
assets,  and  other  resources  that  are  more  costly  than  those  associated  with 
hotel  services  in  producing  the  complete  patient  stay.  A  preferred  measure  of 
output  would  be  based  on  the  "case"  and  the  "episode"  rather  than  the  patient 
day  to  indicate  greater  productivity  when  fewer  days  are  provided  to  treat  the 
same  or  increased  numbers  of  patients  with  acceptable  quality  of  care. 

A  second  major  problem  with  the  MCCU  is  the  relative  value  of  inpatient  ver¬ 
sus  outpatient  care  given  by  the  weights  on  admissions,  bed  days,  and  clinic 
visits.  It  is  easy  to  see  from  the  formula  that,  given  the  average  length  of 
stay  for  Army  patients  worldwide  is  about  six  days,  the  typical  admission 
generates  16  MCCUs.  Since  the  weight  on  a  clinic  visit  is  only  0.3,  the  equiva¬ 
lent  "substitution  rate"  is  53:1.  That  is,  a  physician  would  have  to  see  53 
patients  in  the  ambulatory  setting  to  generate  the  equivalent  workload  in  MCCUs 
of  a  single  inpatient  stay.  Thus,  when  resources  are  allocated  on  the  basis  of 
the  number  of  MCCUs  generated,  even  when  good  medical  judgement  would  dictate 
treating,  say,  a  vasectomy  as  an  outpatient  case,  the  "logical"  course  of  action 
is  to  admit.  Similarly,  when  an  "extra"  day  in  the  hospital  results  in  one  more 
MCCU  with  its  issociated  resources,  the  t endancy  may  be  to  "keep  the  patient  a 
little  longer"  than  is  really  medically  necessary.  Clearly,  the  last  few  days 
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of  care  are  not  very  expensive  for  the  hospital  in  most  cases.  This  is  an 
excellent  example  of  how  the  measurement  tool  prescribes  practice. 

Figure  4  depicts  a  third  major  problem  with  the  MCCU,  its  inability  to 


Figure  4 

WORKLOAD  MEASUREMENT 


INPATIENT  A:  45  year  old  male  presents  with  chief  complaint  of 
crushing  substernal  chest  pain.  Admitted  to  CCU,  Diagnosis: 
AMI.  Third  day  develops  cardiogenic  shock,  undergoes  emergency 
cardiac  catheterization  and  open  heart  surgery  (CABO).  Dies  on 
sixth  hospital  day. 

Est  Cost  $50,000  MCCU  =  16 

INPATIENT  B:  45  year  old  male  presents  with  chief  complaint  of 
crushing  substernal  chest  pain.  Admitted  to  CCU.  Evaluation 
reveals  no  evidence  of  cardiac  diagnosis  but  reflux  esophagitis 
on  barium  swallow.  Transferred  to  ward  on  third  day.  Treated 
with  antacids  and  discharged  sixth  hospital  day. 

Est  Cost  $5,000  MCCU  =  16 

INPATIENT  C:  16  year  old  active  duty  male  presents  with  sore 
throat  and  fever.  Unable  to  eat  and  care  for  himself  in  bar¬ 
racks.  Unit  has  gone  to  field.  Admitted  and  treated  with 
rest,  fluids,  and  aspirin.  Discharged  on  sixth  hospital  day. 

Est  Cost  $1,500  MCCU  =  16 

OUTPATIENT  A:  30  year  old  female  presents  with  2  cm  lump  in 
right  breast  for  outpatient  breast  biopsy.  Diagnosis:  non- 
malignant  lesion. 

Est  Cost  $800  MCCU  =  0.3 

OUTPATIENT  B:  79  year  old  male  with  weakness  in  left  leg  of 
questionable  duration.  CAT  scan  reveals  old  infarct  with  no 
new  lesion.  No  treatment  given. 

Est  Cost  $400  MCCU  =  0.3 

OUTPATIENT  C:  19  year  old  active  duty  male  returns  three  days 
post  emergency  room  visit  for  scalp  laceration.  Sutures  not 
yet  ready  to  be  removed.  Return  in  two  days. 


Est  Cost  $25 


MCCU  =  0.3 
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distinguish  between  different  types  of  inpatients  or  different  types  of  out¬ 
patients.  This  figure  contains  descriptions  of  three  inpatients  and  three  out¬ 
patients,  a  very  rough  estimate  of  the  equivalent  civilian  cost  of  care,  and  the 
number  of  MCCUs  the  patient  would  generate.  These  examples  again  demonstrate 
how  the  MCCU  provides  no  incentives  to  treat  patients  on  an  outpatient  basis  nor 
to  reduce  the  length  of  stay  for  less  severe  cases.  The  measure,  in  essence, 
has  dictated  how  the  care  is  delivered. 

In  an  attempt  to  satisfy  some  of  the  criticisms  of  the  MCCU,  and  at  the  same 
time  develop  an  output  measure  based  on  data  available  under  UCA,  the  Office  of 
the  Assistant  Secretary  of  Defense  for  Health  Affairs  (OASDCHA])  contracted 
Vector  Research,  Inc.,  of  Ann  Arbor,  Michigan,  to  develop  the  Health  Care  Unit 
(HCU).  Using  FY  82  UCA  cost  and  workload  data  in  regression  analysis,  the  stan¬ 
dardized  weights  depicted  in  Figure  5  for  the  two-digit  UCA  accounts  were  devel¬ 
oped. 

The  HCU  is  certainly  a  move  in  the  right  direction.  It  separates  Medical 
inpatients  from  Surgical  inpatients  and  Family  Practice  clinic  visits  from 
Orthopedic  clinic  visits.  Yet,  each  Medical  admission  is  given  the  same  amount 
of  workload  credit  except  for  length  of  stay  and  Outpatients  B  and  C  in  Figure  4 
would  still  record  the  same  amount  of  resource  use  in  terms  of  HCUs.  It  is 
likely  that  the  subdivision  at  the  two-digit  UCA  level  is  not  sufficiently  fine 
to  determine  real  case  mix  differences.  Research  needs  to  be  done,  however,  to 
determine  if  extending  the  HCU  concept  to  the  third  or  fourth  digit  level  might 
be  sufficient  for  output  measurement.  In  addition,  the  current  HCU  weights  only 
decrease  the  equivalent  substitution  rate  of  outpatient  care  for  inpatient  care 
from  53:1  to  about  40:1  on  the  average. 

Of  course,  as  is  being  thoroughly  discussed  in  other  areas  of  this  con¬ 
ference,  Diagnosis  Related  Groups  (DRGs)  are  being  considered  as  a  basis  for  the 
management,  assessment,  and  measurement  of  inpatient  workload.  If  we  can  couple 


REFINED  HCU  WEIGHTS 


HCU 

Clinical  Area 

Disposition 

Weight 

OBD 

Weight 

Visit 

Weight 

Dental  Proc 
Weight 

Medical 

(AA) 

.097 

.137 

Surgical 

(AB) 

.319 

.154 

OB/GYN 

(AC) 

.216 

.148 

Pediatric 

(AD) 

.121 

.120 

Orthopedic 

(AE) 

.604 

.078 

Psychiatric 

(AF) 

.330 

.107 

Medical 

(BA) 

.022 

Surgical 

(BB) 

.028 

OB/GYN 

(80 

.021 

Pediatric 

(8D) 

.017 

Orthopedic 

(BE) 

.028 

Psychiatric 

(BF) 

.026 

Fami ly  Practice 

(BG) 

.021 

Primary  Care 

(BH) 

.021 

Emergency 

(BI) 

.027 

Flight 

(BJ) 

.030 

Underseas 

(BK) 

.015 

Dental  Service 

(CA) 

.005 

Dental  Lab 

(CB/CC) 

.002 

this  with  the  corresponding  concept  in  the  outpatient  area,  i.e.,  Ambulatory 
Visit  Groups  (AVGs),  we  would  have  the  basis  for  a  weighted  measure  of  patient 
care  output.  The  weights  as  signed  to  the  various  categories  of  inpatient  and 
outpatient  care  would  be  based  on  the  relative  resource  requirements  of  typical 
cases  in  each  category.  For  research  purposes,  the  logical  place  to  sta-t  is, 
similar  to  what  the  Veteran's  Administration  (VA)  has  done,  to  use  the  Medicare 
weights  published  by  the  Health  Care  Financing  Administration  (HCFA) .  Other 
weights  are  available  from  the  VA  and  the  State  of  New  Jersey  but  the  services 
should  also  investigate  developing  a  unique  set  of  weights  for  military  medical 
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EFFICIENCY  VERSUS  EFFICACY 

As  mentioned  previously,  the  systems  approach  to  performance  or  productivity 
measurement  described  in  Figure  2,  coupled  with  an  output  measure  based  on,  for 
example,  weighted  DRGs  and  AVGs,  which  is  sensitive  to  the  expected  resource 
requirements  of  the  actual  case  mix  of  the  patient  population,  would  allow  the 
analysis  of  the  efficiency  versus  efficacy  of  resource  utilization.  The  fun¬ 
damental  relationship  which  relates  these  two  components  to  some  total  measure 
of  hospital  productivity  may  be  represented  as  follows. 

WORK  UNITS  x  TOTAL  OUTPUT  =  TOTAL  OUTPUT 
TOTAL  INPUT  WORK  UNITS  TOTAL  INPUT 

The  first  ratio  above  is  an  "efficiency  ratio"  which  is  a  measure  of  how  much 
intermediate  product  of  a  work  center  is  generated  per  unit  of  total  input 
measured,  usually,  in  dollars.  The  second  ratio  is  really  the  reciprocal  of  a 
productivity-type  ratio  and  is  here  called  an  "efficacy  ratio".  This  ratio 
relates  the  work  units  produced  by  the  work  centers  to  the  total  output  of  the 
hospital  and,  thus,  measures  how  well  the  intermediate  products,  e.g.,  lab  tests 
and  x-rays,  have  been  utilized  in  producing  the  case  mix  treated  by  the  hospi¬ 
tal.  The  final  ratio  is  the  common  expression  of  a  "total  productivity  ratio" 
in  that  it  relates  total  output  to  total  input.  The  meaningfulness  of  these 
ratios  is  totally  dependent  upon  the  appropriateness  of  the  measure  of  total 
output.  Again,  historically  this  has  been  the  MCCU. 

In  Figure  6  an  example  is  provided  using  current  UCA  data  to  illustrate 
these  concepts.  In  this  example,  the  HCU  is  used  as  the  measure  of  final  pro¬ 
duct  since  it  dees  begin  to  distinguish  case  mix  although  not  nearly  to  the 
deqree  necessary  for  such  analysis.  This  data  is  taken  from  actual  UCA  reports 
of  three  similarly  sized  Army  MTF.  The  work  units  (intermediate  products)  in 
each  of  the  ancillary  areas  (work  centers)  are  standard  weighted  units.  "Cost 
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per  work  unit,"  Column  (3)  consists  of  "efficiency"  ratios  for  the  ancillary 
support  departments  of  the  three  facilities.  Notice  the  wide  range  of  values. 

It  would  appear,  on  the  surface,  that  Hospital  A  is  much  more  efficient  than 
Hospital  C.  Indeed,  since  these  values  are  the  only  ones  that  appear  on  an 
actual  UCA  report.  Hospital  C  might  be  asked  to  explain  these  figures.  For  each 
of  the  MTF,  the  HCU  was  calculated  and  the  weighted  work  units  per  HCU  displayed 
in  Column  (4)  as  our  "efficacy"  measure.  Notice  again  the  wide  variation  in 
facilities  with  similar  size  and,  it  turns  out,  mission.  If  the  measure  of 
final  output,  the  HCU  in  this  case,  really  did  adjust  for  differences  in  case 
mix  of  the  patient  workload,  then  these  differences  in  efficacy  could  be  exam¬ 
ined  to  determine  if  policies  surrounding  professional  medical  care  delivery 
were  responsible.  As  was  indicated  in  Dr.  Horn's  presentation,  physicians  could 
be  educated  as  to  differences  in  their  practice  patterns.  The  final  column  of 
the  display  indicates  the  overall  impact  of  these  differences.  These  "ancillary 
cost  per  HCU"  ratios  in  Column  (5)  measure  the  overall  "total  productivity"  of 
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tre  ancillary  departments  and  are  the  product  of  Column  (3)  times  Column  (4)  as 
was  suggested  in  the  fundamental  relationship  given  earlier.  It  can  be  seen, 
for  example,  that  even  though  the  Radiology  Service  at  Hospital  C  appeared  to  be 
the  most  inefficient,  the  fact  that  the  physicians  there  are  using  less  costly 
radiology  studies  even  after  adjusting  for  case  mix  (as  much  as  the  HCU  will 
allow),  results  in  the  greatest  overall  productivity  in  treating  final  products, 
i.e.,  lower  ancillary  cost  per  HCU. 

[n  summary,  the  ability  to  separate  the  performance  of  individuals,  clinics, 
hospitals,  regions,  or  commands  into  components  as  described  above,  is  very 
desirable  for  management  purposes.  It  would  allow  the  monitoring  of  produc¬ 
tivity  f  rom  the  individual  to  the  command  level  while  targeting  the  appropriate 
deficiency  when  performance  is  not  as  desired.  It  would  also  allow  the  setting 
of  standards  of  performance  for  efficiency,  efficacy,  and  overall  productivity 
in  a  way  that  could  be  related  to  the  actual  case  mix  of  the  hospitals'  patient 
populations.  Finally,  it  would  prevent  the  organizational  dysfunction  asso¬ 
ciated  with  having  to  "explain  to  higher  headquarters"  seemingly  poor  perfor¬ 
mance  which  may  be  a  function  of  the  measures  utilized  for  evaluation  rather 
than  the  efficiency  or  efficacy  of  actual  performance. 

ORGANIZATION  OF  STUDY  EFFORT 

US  Army  Health  Care  Studies  and  Clinical  Investigation  Activity  (HCSCIA), 
under  the  guidance  of  the  Chief  of  Staff,  HSC,  has  organized  the  Performance 
Measurement  Study  into  four  major  categories  as  shown  in  Figure  7.  Although  the 
details  of  work  being  done  under  each  of  these  headings  are  provided  elsewhere 
in  this  article  and  these  proceedings,  major  activities  are  summarized  here  tor 
conven i ence . 

INPATIENT  CARE:  Adapting  Medicare  Diagnosis  Related  Groups  to  AMEUO  perfor¬ 
mance  monitoring.  Work  to  this  point  has  involved  solving  problems  associated 
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Figure  7 


AMEDD  PERFORMANCE  MEASUREMENT  STUDY  (PMS) 
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INTEGRATION 
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with  converting  IPOS  ICO-9  data  to  the  Medicare  DRGs  which  are  defined  in  terms 
of  1CD-9-CM  coded  patient  records.  Coordination  is  being  made  with  Air  Force 
and  Navy  researchers  to  develop  best  possible  and  consistent  DRG  assignments. 
Activities  are  being  coordinated  with  the  Patient  Administration  Systems  and 
Biostatistical  Activity  to  facilitate  introducing  DRG  based  analysis  potential 
to  MTF  commanders. 

AMBULATORY  CARE:  Establishing  an  Ambulatory  Care  Data  Base  (ACDB)  to  cap¬ 
ture  previously  non-existant  diagnostic  and  resource  use  data  in  the  ambulatory 
setting.  Currently  mark  sense  forms  are  being  developed  and  hardware  leased  to 
expand  the  Redstone  ACDB  project  to  five  additional  sites.  A  major  problem  was 
solved  when  the  Assistant  Secretary  of  the  Army  (Financial  Management)  approved 
a  sole  source  contract  effort  to  lease  equipment,  develop  software,  and  design 
forms  in  support  of  the  ACDB.  The  ACDB  will  reside  in  the  Fort  Detrick  Data 
Processing  Center. 


NON-PATIENT  CARE:  Including  the  impact  of  training,  readiness,  mobiliza¬ 
tion,  arid  quality  assurance  efforts  on  the  ability  of  providers  and  the  MTFs  to 
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deliver  direct  patient  care.  Data  capture  methods  which  are  compatible  with  the 
ACDB  and  Uniform  Staffing  Methodologies  (USM)  data  collection  efforts  are 
currently  being  explored. 

SYSTEMS  INTEGRATION:  Relating  PMS  efforts  to  other  ongoing  studies  and  pro¬ 
jects  such  as  UCA,  USM,  and  Army  Performance  Oriented  Reviews  and  Standards 
(APORS).  A  Performance  Measurement  Data  Base  is  being  integrated  at  the  Fort 
Detrick  Data  Processing  Center  to  support  all  elements  of  the  project.  Current 
efforts  are  directed  toward  determining  the  relationship  between  UCA  costs  at 
the  workcenter  level  and  DRG  based  case  mix  indices. 

TRI-SERVICE  COOPERATION 

Realizing  that  a  unilateral  effort  by  the  AMEDD  to  replace  traditional  work¬ 
load  measures  in  budget  defense  at  DOD  and  before  the  Congress  might  be  counter 
productive,  The  Surgeon  General  directed  the  PMS  study  group  to  attempt  a  Tri- 
Service  effort.  The  first  step  in  this  attempt  is  this  Tri-Service  Performance 
Measurement  Conference  with  representatives  from  the  Army,  Navy,  Air  Force, 
Veterans  Administration,  and  the  Office  of  the  Assistant  Secretary  of  Defense 
(Health  Affairs).  The  final  AMEDD  PMS  staff  will  be  assembled  by  31  October 
1984  and  the  project  should  produce  significant  interim  results  by  30  September 
1985.  Continued  coordination  with  the  Navy  and  Air  Force  efforts  will  be 
accomplished  throughout  the  study,  as  will  contacts  with  OASD(HA)  and  OCHAMPUS. 


AMBULATORY  CARE  DATA  BASE 

LTC  Terry  R.  Misener,  ANC ,  USA 

INTRODUCTION 

Purpose 

Recognizing  the  need  for  an  Ambulatory  Care  Database  (ACDB),  1  he  Army 
Surgeon  General  tasked  the  Health  Care  Studies  Division  (HCSD),  Health  Care 
Studies  &  Clinical  Investigation  Activity  (HCSCIA),  to  examine  the  feasibility 
of  implementing  such  a  study.  The  study  proposed  to  answer  two  questions:  (1) 
Is  it  possible  to  capture  the  necessary  information  for  an  ambulatory  database? 
(i.e.,  will  health  care  providers  complete  encounter  forms  in  addition  to 
entries  they  are  required  to  make  in  the  outpatient  medical  record).  (2)  What 
types  of  reports  can  be  generated  from  the  data  gathered? 

Background 

Although  reports  to  document  Army  outpatient  workload  are  generated  on  a 
recurring  basis  the  reliability  of  the  data  and  their  usefulness  has  been 
questioned.  The  outpatient's  individual  health  record  contains  routine 
information  expected  in  any  outpatient  treatment  setting.  However,  obtaining 
aggregate  data,  auditing  a  random  set  of  outpatient  records,  documenting 
individual  health  care  providers'  practice  profiles,  or  carrying  out 
epidemiological  research,  has  not  been  possible.  A  literature  search  was 
conducted  to  include  a  review  of  the  development  of  standardized  diagnostic 
codes,  data  systems,  methods  of  data  collection,  and  medical  information 
management . 

LIMITATIONS 

Resource  constraints  included  both  time  and  personnel.  The  data  coll ec t i or 
phase  of  the  study  was  to  be  completed  by  the  enu  of  the  3U  QTK  Fi  83.  No  full 
time  employees  could  be  added  for  the  study,  i.e.,  required  personnel  were 


within  the  HCSD,  the  MEDDAC  where  the  study  was  to  be  carried  out,  and  from 
shared  data  processing  staff. 

Prior  studies  demonstrated  that  the  data  gathering  tool  needed  to  be  pro¬ 
vider  centered.  Any  table  look-ups  required  by  the  provider  should  be  kept  to  a 
minimum,  and  providers  must  feel  the  project  to  be  symbiotic,  i.e.,  they  would 
gain  something  in  return  for  their  efforts.  To  be  most  effective,  the  data 

encounter  form  was  not  to  exceed  one  page  (8  £  x  11). 

METHODOLOGY 

Over v i ew 

A  six  month  project  was  undertaken  to  collect  outpatient  encounter  infor¬ 
mation  (including  demographic  data,  workload,  and  diagnoses)  at  Fox  Army 
Hospital,  Redstone  Arsenal,  Huntsville,  Alabama  (1  Nov  82  -  31  Mar  83).  The 
10,000  -  13,000  patients  seen  each  month,  the  clerical  staff,  and  primary  care 
providers  all  assisted  in  completing  a  "mark  sense"  data  capture  form. 

The  hardware  selected  was  the  National  Computer  Systems  (NCS)  Sentry  7001 
Table-Top  Optical  Mark  Sense  Reader,  with  tape  drive  and  transport  printer 
attached.  This  equipment  was  compatible  with  hardware  existing  within  HSC .  NCS 
forms  with  an  individual  lithocode  printed  on  each  form  facilitated  merging  or 
finding  records  easily.  NCS  was  the  only  v'-'dor  known  to  provide  this  feature. 

A  two-sided,  single  page,  multicolor  (purple  and  red)  encounter  form  was 
designed  (see  Appendix  1).  The  face  validity  of  the  form  was  assured  by  the 
investigators  after  consultation  with  other  health  care  providers,  public  health 
professionals,  and  providers  at  Redstone. 

Overall  Army  needs  mandated  that  diagnostic  information  be  a  priority  ele¬ 
ment  in  the  database.  The  International  Classification  of  Health  Problems  in 
Primary  Care  (ICHPPC-2)  was  selected  (truncations  of  the  ICD-9  classification. 
The  codes  were  simple  to  use,  and  had  previously  been  utilized  in  the  Army 


Family  Practice  Database.  The  encounter  form  allowed  space  for  only  250  of  the 
possible  371  diagnoses.  The  remaining  diagnoses  not  on  the  menu  could  be  found 
in  a  preprinted  index  and  then  entered  in  spaces  provided.  Along  with  the 
demographics,  the  diagnostic  information  provides  the  core  of  the  epidemiologi¬ 
cal  data.  These  same  data  gave  the  MEDDAC  the  ability  to  carry  out  peer  review 
and  retrospective  chart  audits  in  a  reliable  and  objective  manner. 

Procedures 

After  a  one  day  pilot  test  of  the  instrument  at  Fort  Hood,  Texas,  minor  form 
design  and  instruction  sheet  changes  were  made.  A  major  change,  suggested  and 
incorporated,  was  to  request  able  patients  to  complete  their  portion  of  the 
form.  Staff  training  at  Redstone  began  two  weeks  prior  to  the  collection  of 
data. 

Prior  to  the  implementation  of  the  study,  code  numbers  were  assigned  to  each 
care  provider  and  each  clinic.  Separate  instruction  sheets  were  prepared  for 
patients,  clerical  staff,  and  care  providers. 

Patients  were  instructed  to  complete  their  portion  of  the  demographic-type 
data  which  was  checked  for  completeness  and  accuracy  by  the  clerical  staff  wh- 
entered  the  clinic  identifier,  family  member  prefix  (to  identify  household  sta¬ 
tus  of  the  patient),  appointment  status,  and  time  in.  The  remainder  of  the  form 
was  completed  by  the  providers.  The  clerical  staff  monitored  completeness  and 
entered  the  time  out  of  the  clinic.  The  provider  had  to  select  one  of  371 
diagnostic  codes  as  the  primary  reason  for  seeing  a  patient  on  a  particular 
visit.  Additionally,  the  providers  were  allowed  to  select  up  to  five  secondary 
diagnoses  germane  to  a  particular  visit  (a  secondary  diagnosis  was  not 
required ) . 

The  patient  portion  of  the  form  required  about  two  minutes  to  complete;  the 
provider  data  required  about  30  seconds  (after  providers  became  familiar  with 
frequently  used  diagnoses).  Clerical  staff  required  about  30  seconds  to  check 
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and  complete  each  form.  After  the  encounter  forms  were  completed  and  checked 
for  errors  in  the  clinic,  they  were  taken  to  a  central  point  in  the 
Administrative  Department  of  the  MEDDAC,  where  one  of  three  trained  persons  pro¬ 
cessed  the  records.  Error-free  and  corrected  forms  were  read  by  the  scanner  and 
output  onto  seven  inch  magnetic  tape.  Tapes  were  transferred  to  Fort  Sam 
Houston,  Texas,  for  processing,  analysis,  and  report  generation. 

FINDINGS 

The  major  study  question  was:  will  providers  complete  the  encounter  forms 
as  requested?  With  approximately  60,000  records  in  the  database,  it  has  been 
demonstrated  that  personnel  will  complete  the  encounter  forms.  All  primary  pro¬ 
vider  visits  included  in  the  Medical  Summary  302  Feeder  Reports  were  counted  in 
the  study.  Visits  to  physicians  accounted  for  53%  of  the  total  encounters;  47% 
were  credited  to  other  providers  (Figure  1). 
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The  second  study  question  was:  what  reports  can  be  generated  from  the 
acquired  data?  Reports  can  be  partitioned  into  three  major  categories:  1)  pro¬ 
vider  profiles,  2)  reports  useful  to  management,  and  3)  special  reports  not 
generated  on  a  recurring  basis.  The  number  of  reports  possible  are  limited  only 
by  the  user's  imagination. 

DISCUSSION 

Primary  care  providers  received  profile  reports  of  their  practice  on  a 

monthly  basis.  The  report  included  the  following  information: 

a  list  and  frequency  of  all  primary  diagnoses 

procedures  reported 

patient  demographic  data 

beneficiary  status  of  patients 

number  and  types  of  physical  examinations 

average  time  patients  spent  in  the  clinic 

Monthly  aggregate  reports  useful  to  management  were  prepared  and  included: 

number  of  patients  seen  in  each  clinic 
number  of  forms  completed  by  each  provider 
information  for  the  Medical  Summary  Report 
MED  302  Service  Branch  Total 
MED  302  Service  Branch  Total/Orl  and  TMC 
number  of  exams  chaperoned  per  clinic 
students  from  other  countries 

Twenty  diagnostic  groups  accounted  for  77.9%  of  the  diagnoses  made  during 
November,  7S.4X  in  December,  76.6%  in  February,  78.9%  iri  March,  and  7/.l%  of  the 
total  diagnoses. 

To  address  reliability  of  data  captured,  the  investigators  randomly  selected 
30  encounter  forms  and  compared  the  entries  against  the  outpatient  charts  for 
the  same  encounter.  The  information  on  the  encoui ter  forms  compared  identically 
i n  100%  of  the  cases . 

A  major  concern  of  the  hospital  staff  was  fear  trial  tn<-  total  encounters 
reflected  by  the  study  would  be  fe*er  tnao  trip  M'/.o  based  on  the  Medical  Nummary 
302  count.  If  tne  MsjjA.  »ere  to  use  toe  enccufter  system  to  re[,  iase  a-i 
Medical  lummary  302  feeder  reports,  would  tne  act  v.ty  be  penalised?  During  ’he 
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training  phase  this  concern  was  alleviated,  when  one  clinic,  which  had  counted 
78  patient  encounters  on  the  MED  302  feeder  report  the  previous  day,  ran  their 
encounter  forms  and  found  120  encounters  documented,  an  increase  of  54%.  A 
major  strength  of  the  encounter  system  is  that  the  counts  or  visits  are  comple¬ 
tely  auditable;  i.e.,  charts  can  be  pulled  to  compare  encounter  forms  to  actual 
patient  visits.  This  information  is  more  difficult  to  extract  from  Medical 
Summary  302  feeder  reports. 

Near  the  end  of  the  study  period,  care  providers  and  clerical  personnel 
participated  in  a  survey  to  measure  their  opinions  of  the  encounter  form  and 
suggestions  (additions/deletions)  to  increase  its  effectiveness.  Twenty  per¬ 
cent  of  the  care  providers  indicated  they  would  like  to  receive  the  practice 
profile  report  on  a  continuing  basis.  When  asked  if  use  of  the  form  should  be 
adopted  Army-wide,  21%  said  yes  because  they  felt  the  form  would  give  good 
estimates  of  workload;  68%  said  no,  their  reasons  focused  on  additional  time 
spent  in  filling  out  the  form,  which  caused  them  to  see  two  to  four  less 
patients  per  day;  11%  had  no  opinion.  (Arguments  regarding  fewer  patients  seen 
were  felt  to  be  artifacts  not  validated  by  administrative  data,  i.e.,  clinic 
hours,  backlogs,  or  number  of  patients  seen,  were  not  affected.)  Forty-seven 
percent  felt  information  gained  by  completing  the  form  was  of  value  to  them, 
while  53%  felt  it  was  of  no  value;  58%  felt  it  was  of  value  to  Fox  Army 
Hospital  and  the  Army  while  26%  felt  it  was  not;  and  16%  had  no  opinion.  Only 
10%  said  they  preferred  not  to  use  the  form  at  all.  Most  receptionists 
responded  that  the  form  was  easy  to  use  and  agreed  that  the  elements  were  in 
the  most  logical  sequence;  90%  felt  the  form  captured  all  information  required 
for  report  generation  such  as  the  MED  Summary  Report  302. 

Several  lessons  have  been  learned  from  the  study: 

(1)  No  one  page  form  can  meet  the  needs  of  every  clinic.  It  is  suggested 
that  further  study  be  undertaken  to  develop  several  forms  for  use  by  different 


159 


specialties  (e.g.,  pediatrics,  obstetrics,  occupational  medicine,  walk-in 
clinic,  etc. ) . 

(2)  Several  providers  found  the  ICHPPC-2  diagnostic  codes  to  be  too  general 
for  their  needs.  This  may  be  a  result  of  the  physicians'  experience  with  the 
ICD-9  codes  for  inpatient  diagnoses.  This  area  bears  further  exploration. 

(3)  Time  to  fill  out  encounter  forms  would  be  greatly  decreased  if  a 
registration  system  were  developed  to  hold  the  patients’  basic  demographic 
data  for  call  up . 

(4)  The  need  for  trained  arid  dedicated  personnel  to  manage  the  project  and 
to  process  encounter  forms  is  obvious.  It  is  not  envisioned  that  added  person¬ 
nel  would  be  required,  but  that  a  realignment  of  duties  may  be  necessary  as  the 
system  would  greatly  decrease  the  MED  Summary  302  clerk's  workload. 

SUMMARY/CONCLUSIONS 

The  overall  objectives  of  the  study  have  been  met. 

(1)  The  elements  to  be  collected  for  an  ambulatory  care  database  were 
identified.  A  significant  number  could  be  standardized  across  MTFs;  however, 
the  need  for  site  specific  variables  is  recognized. 

(2)  The  majority  of  care  providers  will  complete  their  portion  of  the 
encounter  form. 

(3)  A  single  encounter  form  for  all  clinics  is  not  acceptable. 

(4)  Data  collected  can  be  audited  and  provide  an  objective  and  valid 
ambulatory  peer  review  and  quality  assurance  mechanism. 

(5)  Provider  and  clerical  staff  satisfaction  was  surveyed. 

(6)  A  comparison  of  encounters  from  the  ACDB  and  the  MED  Summary  302  was 
accomp , i shed . 

(7)  The  number  of  reports  that  can  be  developed  from  the  data  are  limited 
only  by  the  users'  imagination.  The  MED  Surrtnary  302  can  be  captured  from  the 


data  elements. 
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(8)  The  OMR  method  of  data  capture  was  shown  to  be  efficient  and  cost 
effect i ve . 

(9)  Problems  needing  resolution  in  future  use  of  an  ACDB  were  identified. 

(10)  The  need  for  command  emphasis,  at  the  highest  levels,  is  obvious. 
Failure  to  fill  out  the  form  properly  must  be  viewed  as  negatively  as  falsifi¬ 
cation  of  patient  records. 


RECOMMENDATION 

We  recommend  that  this  inexpensive  and  reliable  data  collection  methodology 
be  tested  at  more  sites  for  eventual  implementation.  The  method  is  highly  prac¬ 
tical  because  it  will  interface  with  any  system  or  mainframe  conceptualized  or 
planned  at  th’s  time.  It  provides  an  excellent  interim  system  until  the 
Composite  Health  Care  System  (CHCS)  is  implemented.  Finally,  it  can  continue  to 
be  used  in  areas  where  a  CHCS  is  not  practical  or  planned,  e.g.,  a  field 
env i ronment . 

PLANS 

It  is  the  plan  of  the  Army  as  a  part  of  the  Performance  Measurement  Study  to 
implement  the  methodology  of  the  Ambulatory  Care  Data  Base  Study  to  capture  data 
for  the  ambulatory  portion  of  the  study.  Data  will  be  captured  for  a  twelve 
month  period  of  time  at  six  medical  activities  for  all  outpatient  encounters. 

The  sites  to  be  used  are  still  under  consideration;  however,  they  will  be  chosen 
to  represent  a  variety  of  configurations  (i.e.,  facilities,  types  of  troop  units 
supported  and  with  varying  numbers  of  beneficiaries  supported).  In  addition  to 
providing  data  on  approximately  four  million  encounters,  it  will  afford  the 
opportunity  to  implement  tne  lessons  learned  in  the  Redstone  experience. 

The  study  group  will  continue  to  maintain  a  liaison  with  the  Yale  group  to 
insure  that  the  elements  captured  will  afford  a  comparable  data  set.  At  the 
same  time,  elements  will  be  included  to  integrate  several  other  database  such 


as  the  UCA  data. 


161 


Finally,  since  the  Army's  database  will  eventually  serve  several  purposes, 
provider  oriented  elements  will  also  be  captured  for  such  purposes  as  cer¬ 
tification  and  specialty  board  requirements.  Because  of  these  requirements,  it 
is  mandatory  that  a  diagnostic  coding  system  such  as  ICD-9-CM  be  considered  and 
that  several  encounter  forms  be  developed,  perhaps  as  many  as  one  for  each  spe¬ 
cialty  service.  Work  with  specialty  consultants  is  under  way  at  this  time  to 
make  this  determination. 

Appendix  2  provides  a  conceptual  model  of  the  package  that  is  planned  for 
installation  at  each  study  site  in  addition  to  the  mainframe  support  for  the 
overall  data  analysis. 


Appendix  1 
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THE  DEFECTIVE  MCCU 
AND  ITS  PROMOTION  OF  INEFFICIENCY 

MAJ  Dennis  L.  Clement,  MSC,  USA 

Over  the  past  20  plus  years,  the  Army  Medical  Department  (AMEDD)  has  tied 
its  monetary  wagon  to  the  Medical  Care  Composite  Unit  (MCCU).  Twenty  years  ago 
the  MCCU  was  a  new,  innovative  and,  eventually,  an  accepted  abstract  quantity 
measurement  of  the  medical  care  provided  in  Army  Medical  Treatment  Facilities 
(M7-).  This  composite  work  measurement  formula  incorporated  accepted  elements 
of  hospital  care  (admissions,  beds  occupied,  live  births  and  clinic  visits)  and 
gave  them  a  relative  weight  based  on  resources  consumed  by  each  element. 

The  MCCU  eventually  became  the  bench  mark  or  reference  point.  Each  MTF 
could  compare  itself  against  previous  performances  and  other  similarly  organized 
,r  1  pnysically  structured  MTFs .  This  reference  point  became  a  unit  for  cost 
comparison  and  eventua’ly  the  basis  for  allocation  and  retraction  of  monetary 
resources  for  medical  care  at  all  Army  MTFs.  The  more  admissions,  beds 
occupied,  live  births  and  clinic  visits  an  organization  could  accomplish  --  the 
more  resources  would  be  forth  coming. 

Unfortunately,  the  MCCU  formula  fails  to  incorporate  a  measurement  of  the 
ancillary  services  (i.e.,  radiology,  pathology  and  pharmacy)  utilized  in  the 
treatment  of  patients  and  of  the  manhours  consumed  in  educating  the  care  provi¬ 
des.  It  also  fails  to  incorporate  a  sufficiently  weighted  measurement  of  hours 
..\,n vimed  in  educating  potential  patients  in  preventive  care  measures.  As  a 
result  of  the  above  failures,  the  MCCU  provides  no  incentive  to  provide  the 
most  effective  and  efficient  care  to  the  patients.  The  emphasis  of  the  MCCU  is 
on  a  partial  list  of  work  units,  not  on  the  end  products.  The  MCCU  formula  was 
ba^eo  cl  the  assumptions  that  objective  diagnostic  testing  would  not  advance, 
that  cars-  providers  would  not  fall  victim  to  preservation  of  their  resource 
base,  and  that  the  health  care  system  would  not  change  focus. 


Over  the  past  23  year;.,  *he  he  a '  th  care  system  has  changed  its  focus  from 
providing  primarily  inpatient  care  to  emphasizing  preventive  care  and  out¬ 
patient  treatment.  Trie  force  for  this  change  iri  focus  was  derived  from  the  fac 
c  -  *  t  preventive  car?  and/:--  outpatient  treatment  was  much  less  costly  and  much 
more  effective  and  efficient  than  providing  inpatient  health  care.  Outpatient 
•;  are  precludes  costly  cursing  care,  wh i i e  providing  positive  psychological 
incentives  for  *  he  patients  to  recover  and  return/cont inue  to  be  a  productive 
member  of  society. 

In  the  effort  to  provide  more  unattended  specific  care  to  the  patient  on  an 
outpatient  basis,  more  objective  diagnostic  tests  were  developed,  a,  well  as, 
more  effective  and  safer  pnarmaceut icai  products.  As  a  result  of  the  emphasis 
on  outpatient  care  and  the  normal  advancement  of  medical  science,  we  have  wit¬ 
nessed  a  quantum  leap  in  types  and  sophistication  of  diagnostic  tests  and  phar¬ 
maceutical  items  provided  our  medical  staff.  As  a  result,  our  medical  staffs 
are  demonstrating  a  greater  diagnostic  dependence  on  objective  testing.  Over 
the  past  10  years,  the  Fort  Knox  MEDDAC  Pathology  Service  has  increased  its 
testing  capability  by  approximately  25  additional  tests;  the  Radiology  Service 
has  incorporated  advanced  radiological  techniques  which  were  not  available  10 
years  age  (Table  1);  while  the  pharmaceutical  utilization  has  burgeoned  with  a 
more  than  mildly  esi.alat.ing  cost  per  pharmacy  procedure. 

This  inc^ase  in  the  availability  of  new  objective  diagnostic  testing  in 
-rdio'oqy  and  pathology  has  resulted  in  the  care  provider's  increased  use  of 
such  procedures  per  patient.  Increased  procedures  per  patient  has  resulted  in 
an  increase  in  cost  of  treatment  per  patient.  The  MCCU  measurement  does  not 
reflect  the  additional  work  being  accompl i sited  within  a  medical  treatment 
facility  as  a  result  of  the  increased  use  of  diagnostic  procedures. 

As  a  matter  of  fart,  in  reviewing  the  pathology  procedures  accomplished  at 
Fort,  Knox  MtDOAu  over  the  past  eight  years,  we  find  a  dramatic  procedure 
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Table  1 


RADIOLOGY  DIAGNOSTIC  TESTS 


PROCEDURES  PROJECTED  ANNUAL 

REQUIREMENTS 

Routine  Exams 

Head  Neck 

5,000 

Chest  Ribs 

22,000 

Abdomen 

5,000 

Spine  Pelvis 

3,400 

Extremities  and  Pelvimetries 

11,000 

Fluoroscopy  Exams 

Upper  GI 

1,800 

Barium  Enema 

720 

Small  Bowel  1  Fol low  Through 

65 

Other  (Gall  Bladders,  Pacemaker 

105 

Insertions ) 

Special  Procedures 

IV  Pyelograms 

720 

Operative  Cholangiogram 

60 

Tomogram 

240 

T-Tube  Cholangiogram 

20 

Arthogram 

50 

Histosalpingogram 

40 

Angiography 

30 

Voiding  Cystourethrograms 

100 

Venogram 

30 

Myelography 

30 

Other  (Sinogram,  Kidney,  Cyst,  Retrograde, 

50 

Laryngogram,  Lymphangi gram. 

Bronchogram,  or  other  special 

procedures  as  requested  by  staff 

physicians) 

New  Tests 

Ultrasound 

2,340 

Xeromammogram 

900 

CT  Scan 

500 

Linear  Accelerater 

decrease  in  1980  with  an  increase  in  MCCUs.  What  happened?  In  1980  automa¬ 
tion  was  realized  in  the  laboratory.  New  equipment  reduced  the  CAP  pathology 
procedure  count.  The  automated  equipment  performed  more  tests  in  less  time 
and  with  less  cost  than  the  manual  system.  The  fact  that  more  procedures  were 


170 


accomplished  without  a  commensurate  increase  in  cost  per  MCCU,  could  lead  one 
to  believe  the  MCCU  is  an  adequate  measurement  for  distributing  resources. 

Such  is  not  the  case.  The  adequacy  of  funding  for  the  future  years  occurred 
in  spite  of  the  MCCU  formula.  The  adequacy  of  funding  occurred  as  a  result  of 
an  equipment  change  --  a  quirk  of  fate  --  rather  than  the  adequacy  of  the  work 
measurement  unit  or  any  trends  it  may  have  indicated. 

A  review  of  average  daily  pathology  procedures,  identified  in  Figure  1, 
indicate  procedures  are  increasing  as  the  average  daily  MCCUs,  displayed  in 
Figure  2,  are  decreasing.  This  review  would  indicate  that  the  quantity  of 
proceaures  performed  per  patient  is  increasing.  Carried  one  step  further,  we 
should  conclude  that  without  additional  technical  advancement  in  pathological 
testing  equipment  economies,  we  can  expect  the  trend  toward  more  objective 
diagnosis  to  increase  patient  treatment  costs  significantly. 

In  the  area  of  radiology  service  we  find  a  similar  situation  (See  Figure 
3).  New  applications  of  old  technology  and  new  technologies  (e.g.,  ultra  sound 
and  Computer  Aided  Tomography  scans)  have  resulted  in  more  expensive  tests  and 
in  many  cases  more  tests  per  patient.  The  MCCU,  however,  does  not  incorporate 
these  diagnostic  test  procedures  in  its  composite  formula.  As  a  result,  the 
use  of  new  radiological  tests  penalizes  the  MTF  by  consuming  more  resources 
than  provided  by  the  MCCU  justification.  In  effect  the  MTF  reduces  their 
fut  ire  available  resources  if  the  use  of  the  new  radiology  services  reduces 
treatment  time  for  inpatients  or  precludes  future  outpatient  visits,  which 
would  in  turn  justify  future  resources.  (Additional  outpatient  visits  or  inpa¬ 
tient  days  would  have  increased  MCCUs,  thereby  justifing  more  future 
resources . ) 

Of  all  the  increases  in  ancillary  services  the  pharmacy  procedure  has 
displayed  the  most  dramatic  increase  at  Fcrt  Knox  MEDDAC  (see  Figure  4).  From 
1970  to  1984  pharmacy  procedures  have  increased  from  1.70  to  3.41  procedures 


per  MCCU.  Has  the  increase  in  use  of  pharmaceuticals  been  beneficial  to  the 
patient?  Did  it  reduce  recovery  time?  Did  this  increase  benefit  anyone?  It 
is  impossible  to  determine  by  evaluating  the  MCCU.  The  only  certain  deter¬ 
mination  is  that  the  consumption  rate  of  resources  has  increased.  New  high 
tech  pharmaceuticals  are  more  expensive  than  they  were  10  years  ago  and  the 
number  of  pharmacy  procedures  per  MCCUs  has  increased. 

As  a  result  of  the  MCCU's  failure  to  incorporate  the  effect  of  the  evolu¬ 
tion  from  subjective  to  objective  diagnosis,  the  care  providers  have  become 
aware  of  the  significance  of  outpatient  treatment  versus  inpatient  treatment. 
The  care  providers  have  become  acutely  aware  that  an  inpatient  visit  will 
generate  on  the  average  14-15  MCCUs  compared  to  an  outpatient  visit  which 
equates  to  .3  MCCU.  The  provider  of  care  is  quick  to  note  that  if  extensive 
or  sophisticated  (expensive)  diagnostic  testing  is  required,  it  makes  sense  to 
admit  the  patient  rather  than  provide  the  tests  and  care  on  an  outpatient 
basis.  The  admission  would  justify  more  future  resources  (Ft  Knox  cost  of 
$82.31  per  AD  MCCU  i.e.  $24.69  per  outpatient  vs  $1152.34  per  inpatient). 
Examples  of  diagnostic  tests  and  treatments  which  can  be  performed  on  an  out¬ 
patient  basis  but  were  at  one  time  performed  on  an  inpatient  basis  are  found 
in  T able  2. 

The  care  provider's  knowledge  of  the  effect  of  MCCU  production  on  the 
availability  of  resources  may  not  cause  conscious  admission  of  a  true  out¬ 
patient  by  a  care  provider.  But,  at  a  minimum,  it  will  persuade  the  care  pro¬ 
vider  to  err  to  his  advantage.  The  net  effect  of  admitting  a  patient  who 
could  remain  an  outpatient  costs  the  government  resources  that  were  not  a 
necessary  expenditure  (food,  nursing  staff,  military  training,  or  service 
time,  etc. ). 

The  failure  of  the  MCCU  formula  to  incorporate  in  some  fashion  the  health 
care  provider's  manhours  expended  on  continuing  health  education  (CHE)  distort 
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Table  2 

EXAMPLES  OF  OUTPATIENT/INPATIENT 
DIAGNOSTIC  TESTS  AND  TREATMENT 


1.  Internal  Medicine  Clinic 

a.  Protoscopy 

b.  Upper  GI 

c.  Barium  Enema 

d.  Endoscopy 

e.  Colonoscopy 

2.  Ortho  Clinic 

a.  Nerve  Block 

b.  Mylograms 

c.  Arthroscopy 

3.  Urology  Clinic 

a.  Vasectomy  Treatment 

b.  Circumcision  Treatment 

c.  Stone  Basket 

d.  Plyograms 

4.  Opthomology  Clinic 

a.  Laser  Optics  Treatment 

5.  OB/GYN  Clinic 

a.  D&C  Treatment 

b.  NST  (None  Stress  Test) 

c.  OCT  (Oxitocen  Challenge  Test) 

d.  Scope  Tubal 


the  measurement  of  care  provided.  The  care  provider  has  two  primary  tasks  to 
perform:  treating  patients  and  upgrading  his  knowledge  of  the  most  effective  and 
efficient  treatment  methods  and  modalities.  Failure  to  acquire  this  additional 
knowledge  will  eventually  effect  the  quality  and  quantity  of  the  care  he  pro¬ 
vides. 

Acquiring  CHE  knowledge  is  in  direct  conflict  with  treating  patients.  When 
the  care  provider  is  receiving  this  training  he  is  not  treating  patients.  He  is 
a  resource  provided  to  treat  patients,  but  is  not  doing  so.  In  fact,  the  pre- 
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sent  system  of  distributing  resources  based  on  the  MCCUs  penalizes  the  MTF  for 
allowing  the  care  provider  time  off  from  treating  patients  to  receive  CHE.  The 
average  FY  83  CHE  physician  or  physician  assistant  training  experience  removed 
the  practitioner  from  the  direct  care  work  force  for  a  total  of  six  days  each. 
Six  days  of  patient  treatment  for  each  of  our  66  practitioners  were  lost  --  over 
one  year  of  direct  patient  care! 

An  argument  against  including  a  weighted  measurement  for  ancillary  services 
and  care  provider  education  in  the  formulation  of  the  MCCU  could  hinge  on  the 
fact  that  these  areas  are  supportive  of  patient  treatment  and  are  not  direct 
contact  situations.  Whereas,  all  the  elements  presently  included  in  the  MCCU 
formulation  are  direct  contact  situtations.  The  inclusion  of  these  supportive 
services  would  cause  the  care  provider  to  use  diagnostic  testing  excessively, 
over  prescribe  pharmaceuticals,  and  spend  inordinate  amounts  of  time  receiving 
additional  training  rather  than  treating  patients. 

If  the  care  provider  stood  to  personally  gain  by  over  testing  and/or  over 
prescribing,  the  argument  may  be  valid.  However,  military  care  providers  gain 
nothing  personally.  The  only  entity  that  could  gain  by  excessive  use  of  suppor¬ 
tive  services  would  be  the  organization.  The  incentive/pressure  to  do  so  would 
be  far  less  than  the  present  pressure  to  admit  every  patient  possible.  The  pre¬ 
sent  pressure  is  caused  by  the  lack  of  sufficient  resources  to  support  true 
patient  requirements  and  the  associated  required  diagnostic  testing  and  phar¬ 
maceutical  items.  If  correct  and  adequate  resources  were  identified  and  pro¬ 
vided  by  the  MCCU  resource  justificaion  system,  less  manipulation  of  work  units 
would  be  the  order  of  the  day.  The  excessive  education  of  care  providers  gained 
by  sacrificing  patient  treatment  time  could  be  prevented  by  limiting  the  number 
of  hours  or  days  of  CHE  training  time  a  care  provider  could  have  per  year. 

The  effect  of  failing  to  sufficiently  weight  the  event  of  educating  poten¬ 
tial  patients  in  preventive  care  defeats  the  concept  of  preventive  care. 
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Preventive  care  education  is  based  on  the  premise  that  it  is  much  more  effi¬ 
cient  to  prevent  injury  or  disease  by  educating  high  risk  persons,  rather  than 
treating  the  injury  or  disease.  The  MCCU  formula  provides  a  .3  weight  to  a 
preventive  care  session  outside  of  the  MTF  regardless  of  the  size  or  the  dura¬ 
tion  of  the  class. 

At  the  Fort  Knox  MEDDAC  during  May  1984,  2,894  potential  patients  were  pro¬ 
vided  Fitness  &  Injury  Prevention  Classes  in  41  presentations.  Credit  for  the 
work  accomplished  was  12.3  MCCU.  Each  presentation  was  approximately  50  minutes 
long,  not  including  the  care  provider's  travel  time.  If  the  care  provider  had 
remained  in  the  MTF  and  treated  one  patient  every  40  minutes,  he  would  have 
surely  achieved  more  MCCUs  for  the  same  period  of  time,  especially  if  a  few  of 
the  patients  were  admitted. 

The  failure  of  the  MCCU  to  incorporate  adequate  measurement  of  ancillary 
services  consumed,  care  provider  education  experienced,  and  preventive  care 
classes  conducted,  highlights  the  severe  limitation  of  the  MCCU  measurement 
unit.  The  measurement  unit  must  at  least  count  those  fuctions  that  are  con¬ 
tinually  changing  (e.g.,  radiology)  and  promote/support  the  most  efficient 
concept  of  providing  health  care.  At  its  best,  the  MCCU  is  a  gross, 
incomplete,  and  misleading  measurement  of  work  accomplished  in  a  MTF.  In  a 
nutshell,  the  MCCU  does  not  promote  efficient  or  effective  use  of  resources. 

The  MCCU  as  it  is  used  in  the  distribution  of  resources  promotes  excessive  and 
wasteful  use  of  resources.  A  more  effective  work  measurement/work  costing 
system  is  direly  needed  if  the  MTFs  are  to  become  effective  and  efficient. 


NOTE:  Workload  data  upon  which  this  presentation  was  based  is  shown  in  Table  3. 


Table 


Services  Command,  Command  Performance  Summary 
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NURSING  PRODUCTIVITY  AS  MEASURED  BY  THE 
NAVY  WORKLOAD  MANAGEMENT  SYSTEM 

CDR  Karen  A.  Rieder,  NC ,  USN 

I ntroduction 

One  of  the  most  difficult  problems  facing  directors  of  nursing  service  is 
the  determination  of  nurse  staffing  requirements.  Traditionally,  staffing  needs 
were  based  upon  daily  inpatient  census,  and  staffing  standards  were  set  on  a 
ratio  of  hours  of  care  per  24  hour  patient  day.  For  example,  3.5  hours  of  care 
per  patient  day  may  oe  used  with  staffing  levels  generally  set  high  to  ensure 
adequate  coverage  during  peak  loads  (DHEW,  1973).  Another  major  shortcoming  was 
the  assumption  that  nursing  care  requirements  would  be  the  same  for  every 
patient.  The  result  was  a  continuous  imbalance  of  workload.  Although  super¬ 
visors  did  make  daily  staffing  reassignments  based  upon  their  judgement  and 
experience,  these  changes  were  viewed  as  subjective  and  frequently  resulted  in 
charges  of  unfairness  or  favoritism.  Therefore,  studies  were  undertaken  to  find 
a  more  objective  method  of  determining  staffing  requirements.  One  technique 
that  is  widely  used  today  is  a  Patient  Classification  System  (PCS).  Under  this 
method,  staffing  requirements  are  based  upon  the  acuity  levels  of  patients.  A 
Patient  Classification  System  refers  to  the  "identification  and  classification 
of  patients  into  care  groups  or  categories,  and  to  the  quantification  of  these 
categories  as  a  measure  of  the  nursing  effort  required"  (Giovenetti,  1979,  p.4). 
Abdeilah  and  Levine  (1979)  have  identified  two  types  of  Patient  Classification 
Systems.  The  first  type,  "prototype  evaluation,"  matches  patients  to  a  standard 
profile  or  stereotype  for  each  category.  The  second  and  most  common  type, 
"factor  evaluation,"  utilizes  critical  indicators  or  descriptors  of  direct  care 
requirements  to  separately  rate  the  care  requirements  for  each  patient.  Each 
factor  earns  a  score.  The  scores  are  then  summed  with  the  total  score 
designating  a  patient's  category. 
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Patient.  Classification  Systems  were  not  widely  implemented  in  hospitals 

until  the  Joint  Commission  on  Accreditation  of  Hospitals  revised  Nursing  Service 

Standard  III  in  1980.  The  standard  states  that: 

The  nursing  department/service  shall  define,  implement, 
and  maintain  a  system  for  determining  patient  require¬ 
ments  for  nursing  care  on  the  basis  of  demonstrated 
patient  needs,  appropriate  nursing  intervention,  and 
priority  of  care.  (JCAH,  1980,  p.  118) 

With  this  mandate,  Patient  Classification  Systems  have  gained  popularity. 

T h e_  N av y  Wo r k lo ad  Management  System 

The  development  of  the  Navy  Workload  Management  System  (WMS)  which  includes 
a  patient  classification  tool  was  initially  undertaken  at  the  request  of  the 
Navy  Surgeon  General  (SGO,  1979)  and  reinforced  by  the  mandate  from  the  Joint 
Commission  on  the  Accreditation  of  Hospitals.  This  development  was  part  of  an 
ongoing  three  year  research  project  which  examined  the  utility  of  a  Patient 
Classification  System  as  the  basis  for  nurse  staffing  decisions  within  the  Navy 
Medical  Department.  What  evolved  was  a  WMS  for  nursing  which  is  designed  to 
accurately  determine  staffing  requirements  based  on  inpatient  nursing  workload. 
This  system  provides  a  useful  means  for  establishing  appropriate  baseline  staff 
and  for  allocating  personnel  to  meet  daily  fluctuations  in  workload. 

Figure  1  delineates  how  the  Workload  Management  System  operates.  The  pro¬ 
cess  begins  with  the  classification  of  patients  into  categories  of  care.  The 
hours  of  nursing  care  required  and  the  recommended  number  and  mix  of  personnel 
needed  to  meet  these  requirements  are  then  calculated  based  on  the  number  of 
patients  in  each  category.  The  actual  number  and  mix  of  pet sonnel  assigned  is 
then  compared  with  the  recommended  staffing  to  determine  if  staffing  levels  are 
above,  below,  or  within  the  recommendations.  If  staffing  levels  for  the 
workload  to  be  accomplished  differ  from  recommended  levels,  staffing  can  be 
adjusted  to  balance  the  variation. 
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Figure  1 

DYNAMICS  OF  THE  WORKLOAD  MANAGEMENT  SYSTEM 


In  short,  the  Workload  Management  System  encompasses  an  integrated  process 
of:  (1)  assessment  and  classification  of  patients;  (2)  allocation,  assignment 
and  scheduling  of  nursing  personnel;  and,  (3)  an  evaluation  or  monitoring  of 
care  given. 

In  order  to  give  a  clear  understanding  of  the  workload  management  process, 
it  is  essential  to  review  both  the  classification  and  staffing  phases  of  the 
system.  This  can  be  done  by  defining  the  terms  which  form  the  foundation  of  the 
system  and  describing  the  reports  used. 

Patient  Classification  of  Direct  Care  Activities 

The  system  begins  with  the  classification  of  all  patients  on  a  daily  basis. 
This  is  the  grouping  of  patients  according  to  some  assessment  of  their  nursing 
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care  requirements  over  a  specified  period  of  time  for  the  purpose  of  determining 
the  number  of  nurses  required  to  provide  that  care.  This  grouping  is  done  with 
a  tool  which  delineates  critical  indicators  of  care,  those  activities  that  have 
the  greatest  impact  on  direct  care  time. 

The  Critical  Indicator  Guidesheet  is  the  basis  of  the  Patient  Classification 
System.  This  guidesheet  lists  multiple  patient  care  activities,  each  with  an 
associated  point  value  that  indicates  the  amount  of  time  needed  to  perform  the 
activity.  Each  point  is  worth  7J  minutes  of  direct  nursing  care  time.  Nurses 
use  this  guidesheet  as  a  tool  to  classify  each  patient  on  a  unit.  The  Critical 
Indicator  Sheet  is  divided  into  factors  which  are  a  group  of  critical  indicators 
that  cover  one  specific  domain  of  activities.  They  include  nine  areas:  vital 
signs,  monitoring,  activities  of  daily  living,  feeding,  treatments,  respiratory 
therapy,  IV  therapy,  teaching/emotional  support,  and  continuous  care.  All  of 
these  are  direct  care  activities  which  are  defined  as  those  actions  that  take 
place  in  the  presence  of  the  patient  and/or  family.  The  activities  are  obser¬ 
vable,  behavioral,  and  include  the  following:  placement  of  equipment  at  bed¬ 
side,  explanation  of  procedure  to  patient,  preparation  of  patient,  performance 
of  task,  removal  of  equipment  from  area,  recording,  assessment/observation,  and 
teaching.  In  the  current  system  these  direct  care  activities  are  calibrated  as 
taking  35  percent  of  the  total  nursing  care  time. 

The  specific  number  of  points  assigned  to  each  activity  has  been  based  upon 
documented  time  and  motion  studies.  To  standardize  the  point  value  for  each 
critical  indicator,  the  Navy  utilized  the  results  obtained  in  a  Nursing  Care 
Hour  Standards  Study  completed  by  the  Army  in  1981.  LTC  Susan  Sherrod,  a  nurse 
researcher  at  the  Health  Care  Studies  Division  of  the  Academy  of  Health 
Sciences,  conducted  a  four-year  time  and  motion  study  in  which  37,000  obser¬ 
vations  were  rade  to  derive  mean  times  for  357  direct  nursing  care  activities  at 
nine  hospitals  for  all  levels  of  nursing  staff.  These  time  measurements  were 


utilized  to  determine  minimal  essential  mean  tasking  times  for  all  specified 
direct  nursing  care  activities.  In  addition,  a  methodology  for  determining  care 
provider  numbers  and  mix  for  various  specialty  areas  was  developed. 

In  adopting  the  time  values,  an  assumption  was  made  that  direct  nursing  care 
given  by  Navy  personnel  would  require  the  same  amount  of  time  as  with  Army  per¬ 
sonnel.  This  assumption  was  tested  for  several  activities  and  found  to  be 
valid.  Since  the  Sherrod  study  has  been  cited  as  having  the  most  comprehensive 
and  best  documented  task  list  to  be  found  anywhere  in  nursing  literature  (US 
Army,  1982),  the  Navy  has  a  high  level  of  confidence  in  using  the  point  values 
assigned  to  each  critical  indicator. 

The  operational  definitions  for  these  critical  indicator  activities  have 

been  developed  and  published  in  the  form  of  guidelines.  These  guidelines  have 

been  incorporated  into  an  educational  workbook  that  is  used  as  an  interpretive 

text  for  the  definition  of  each  critical  indicator  and  its  application  during 

the  direct  care  of  patients.  For  example,  one  of  the  activities  on  the  critical 

indicator  sheet  is  "blood  pressure,  manual".  The  operational  definition  states: 

includes  time  to  place  equipment  at  bedside,  place  cuff 
around  extremity,  position  stethoscope,  measure  blood 
pressure,  remove  cuff,  record  results,  and  remove  equip¬ 
ment  from  area. 

Based  on  the  number  of  activities  and  the  total  amount  of  points  assigned 
for  each  critical  indicator  factor,  a  patient  can  fall  into  one  of  six  cate¬ 
gories  of  care.  Category  is  defined  as  the  representative  grouping  of  patients 
according  to  their  nursing  care  requirements.  A  Category  I  patient  requires 
minimal  care  while  a  Category  VI  requires  intensive  care. 

Therefore,  a  nurse  classifies  each  patient  using  a  critical  indicator  sheet 
and  the  results  are  tabulated  on  the  Patient  Classification  Worksheet.  This 
worksheet  lists  each  patient  on  a  unit,  the  total  points  or  time  their  nursing 
care  requires,  and  the  category  of  care  based  on  the  points  awarded  to  the 


patient.  This  form  also  provides  a  summary  of  the  number  of  patients  in  each 
category  on  that  nursing  unit.  The  summary  from  this  worksheet  becomes  the 
working  basis  for  determining  staffing  requirements. 

Accounting  for  Indirect  Care  Activities 

Since  the  Patient  Classification  System  only  addresses  direct  care  activi¬ 
ties,  a  mechanism  has  been  incorporated  into  the  system  for  indirect  care  activ 
ities.  Indirect  care  is  defined  as  all  nursing  activities  done  away  from  the 
patient  in  preparation  for  or  completion  of  care,  as  well  as  those  activities 
directed  toward  general  unit  management.  To  address  these  factors,  indirect 
care  time  and  special  allowances  have  been  incorporated  into  the  nursing  care 
hour  requirements  for  each  of  the  six  patient  care  categories.  Indirect  care 
time  takes  approximately  55  to  65  percent  of  the  total  nursing  care  time. 
Indirect  care  time  is  subdivided  into  the  following  four  categories:  indirect 
care  -  30%;  unpredicted  needs  -  15%;  teaching  hospital  allowance  -  10%;  and 
semi -private  room  allowance  -  20%. 

In  summary,  the  nursing  care  hour  requirements  determined  by  using  this 
system  include  direct  care  time,  indirect  care  time  and  special  allowances  in 
order  to  account  for  the  significant  variations  between  patient  care  activities, 
nursing  units  and  medical  facilities.  The  next  step  is  to  translate  these 
activities  into  staffing  numbers. 

Staffing  Methodology 

The  patient  care  coordinator  for  each  clinical  area  receives  the  Patient 
Classification  Worksheet  and  reviews  it  for  accuracy.  In  order  to  ascertain  the 
amount  of  nursing  care  required  for  a  specific  group  of  patients  who  may  run  the 
continuum  from  Category  I  to  VI,  the  supervisor  calculates  the  nursing  care  hour 
requirements.  This  consists  of  calculating  the  number  of  hours  of  nursing  care 
time  required  for  each  category  of  patient  based  upon  an  assessment  of  their 
direct  and  indirect  nursing  care  requirements.  It  is  operationalized  via  six 


A  .  ’ . 


185 


pre-calculated  Nursing  Care  Hour  Requirement  Charts  which  incorporate  two  fac¬ 
tors:  type  of  unit  (open,  semi -private  room,  nursery,  or  light  care),  and  type 
of  facility  (teaching  vs.  non-teaching  hospitals). 

These  nursing  care  hour  requirements  are  then  translated  into  personnel 
requirements  using  an  eight  hour  workshift  pet  employee  as  the  standard. 
Personnel  requirements  are  the  number  and  mix  of  RNs  and  NRNs  required  to  care 
for  the  patient  workload  on  a  unit.  This  is  operationalized  via  two  charts: 
acute  care  and  intensive  care.  The  acute  care  chart  allocates  a  40%  RN  to  60% 
NRN  personnel  mix  and  distributes  45%  of  this  staff  to  the  day  shift,  35%  to  the 
evening  shift,  and  20%  to  the  night  shift.  In  contrast,  the  intensive  care 
chart  utilizes  a  60%  RN  to  40%  NRN  personnel  mix  which  is  evenly  distributed 
across  each  shift. 

The  classification  data  and  the  recommended  number  and  mix  of  personnel  are 
recorded  on  the  daily  summary  sheet.  The  Workload  Management  Summary  Sheet 
identifies  on  a  daily  basis  the  number  of  patients  in  each  category  on  a  nursing 
unit,  the  recommended  staffing  for  each  shift,  the  actual  staff  scheduled  on 
each  shift,  and  any  variance  between  the  recommended  and  actual  staffing.  Any 
changes  to  the  schedule  are  recorded  on  this  form  so  that  the  number  of  person¬ 
nel  who  actually  worked  on  each  shift  is  documented. 

The  last  instrument  used  is  the  Monthly  Staffing  Summary  Graph.  This  is 
designed  as  a  tool  to  identify  trends  in  workload  and  staffing  distribution 
problems  on  units  for  each  shift.  The  graphic  display  will  document  the  recom¬ 
mended  daily  requirements  as  compared  to  the  actual  scheduled  staff  and  what  the 
staffing  looked  like  after  changes  have  been  made. 

Rel iabi 1 ity  Testing 

In  order  for  the  patient  classification  process  to  generate  accurate  and 
useable  information,  an  inter-rater  reliability  monitoring  system  must  be  used 


on  a  regular  basis.  Inter-rater  reliability  refers  to  the  consistency  or  stabi¬ 
lity  of  measurement  of  the  patient  classification  instrument  from  user  to  user. 
Reliability  is  evaluated  by  having  two  individuals  classify  the  same  patient 
independently  on  Patient  Classification  Worksheets.  This  information  is  then 
used  to  compute  an  index  of  equivalence  or  agreement  by  factors  and  by  category 
between  classifiers. 

The  purpose  of  the  reliability  testing  is:  (1)  to  measure  the  Percentage  of 
agreement  among  nurses  in  selecting  patient  classification  categories  and  fac¬ 
tors;  (2)  to  identify  the  need  for  updating  classification  skills  and/or 
revising  patient  classification  categories  and  decision  rules;  and,  (3)  to 
routinely  monitor  the  patient  classification  process  to  assure  that  all  nursing 
personnel  continue  to  use  the  process  in  the  manner  intended.  In  order  to  do 
this  testing  an  independent,  expert  classifier  appointed  by  nursing  adminis¬ 
tration  must  conduct  the  reliability  testing.  It  should  be  done  routinely 
(approximately  monthly)  on  all  nursing  units  involved  in  patient  classification 
and  efforts  made  to  maintain  a  minimum  of  8056  agreement  in  patient  categories 
and  factors  among  classifiers.  If  the  percent  of  agreement  by  category  is  below 
80%,  efforts  should  focus  on  discussions  with  unit  classifiers  to  determine  the 
reasons  for  disagreement.  Corrective  action  must  be  taken  to  increase  inter¬ 
rater  reliability. 

To  reiterate,  ongoing  reliability  testing  is  crucial  to  validating  the 
accuracy  of  the  data  forwarded  from  each  facility.  When  one  considers  that, 
based  on  this  information,  major  decisions  will  be  made  at  both  the  hospital  and 
headquarters  levels,  the  importance  of  this  process  is  self-evident. 

Criteria  for  System  Performance 

In  order  for  the  WMS  to  perform  effectively  the  following  system  perfor¬ 
mance  criteria  should  be  met: 
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1.  Comprehensiveness  -  The  system  should  classify  inpatients  according  to 
levels  of  required  nursing  care  and  determine  the  amount  of  nursing  time  re¬ 
quired  to  care  for  these  patients.  It  should  apply  to  all  inpatients  at  all 
Navy  hospitals  and  should  account  for  both  direct  and  indirect  care.  In  this 
way,  the  system  should  produce  a  series  of  daily  and  monthly  reports  that  are 
useful  both  in  the  operation  of  the  hospital  and  in  the  planning  and  budgeting 
of  resources.  The  reports  should  be  timely  and  provide  information  on  actual 
patient  days,  actual  nurse  staffing,  and  nurse  workload  by  patient  category,  by 
shift,  by  ward,  and  by  personnel  category. 

In  order  to  do  this,  that  same  definition  of  patient  classes  should  be  used 
throughout  the  Navy.  The  definition  should  he  simple  to  understand  and  the  pro¬ 
cedures  for  using  the  tool  speedy  and  reliable. 

2.  Val idity  -  The  system  should  also  measure  what  it  purports  to  measure. 
The  validity  of  the  patient  categories  and  the  times  for  direct  care  should  be 
assessed  on  a  Navy-wide  basis  using  expert  opinions  and  objective  data. 

3.  Re  1 i ab i 1 ity  -  Unsystematic  variation  between  raters,  between  hospitals 
and  wards,  and  overtime,  must  be  maintained  at  a  reasonable  level.  The  relia¬ 
bility  of  the  Workload  Management  System  should  be  assessed  at  each  hospital 
monthly.  To  ensure  this,  an  implementation  plan  should  include  training  and 
orientation  for  all  personnel,  as  well  as  assignment  of  responsibility  for 
various  phases  of  the  system  at  each  hospital. 

4.  Usefulness  -  The  Workload  Management  System  should  be  a  valid  management 
tool  for  identifying  patient  workload  and  assigning  appropriate  staff.  It 
should  retrospectively  document  what  has  been  done  for  the  patient  and  prospec¬ 
tively  identify  what  nursing  care  will  be  needed  for  the  following  shifts. 

To  complete  the  Workload  Management  System  Project,  an  evaluative  study  was 
begun  in  February  1984  to  assess  its  validity  and  reliability  (Rieder,  1983). 
Over  a  period  of  six  months,  on-site  visits  were  planned  to  six  CONUS  hospitals 
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randomly  selected  based  upon  size.  Visits  to  five  sites  have  thus  far  been 
completed.  Interrater  reliability  was  tested  by  comparing  scores  awarded  by  a 
nurse  expert  to  scores  assigned  by  the  nurse  classifying  patients  on  the  same 
nursing  units.  Initial  reliability  results  have  been  greater  than  .80.  Other 
data  was  collected  by  administering  questionnaires  which  measure  charge  nurses' 
perceptions  of  staffing  adequacy  and  staff  nurses'  perceptions  of  direct  and 
indirect  nursing  care  given  under  various  staffing  patterns. 

Throughout  this  project,  close  collaboration  has  been  maintained  with  the 
Army  Nursing  Research  Department,  Walter  Reed  Army  Medical  Center,  Washington, 

D.  C.  The  Army  has  field  tested  the  WMS  at  five  Army  Military  Treatment 
Facilities.  Based  on  their  findings,  the  WMS  will  be  implemented  at  all  facil¬ 
ities  in  October  1984.  In  addition,  the  Army  has  recently  completed  a  work 
sampling  study  of  indirect  nursing  care  at  nine  hospitals  (U.  S.  Army,  1983). 
Results  from  this  study  will  be  valuable  in  determining  if  the  Navy  Nursing  Care 
Hour  Requirement  Charts  need  to  be  revised. 

Benefits  of  the  System 

The  benefits  to  the  Navy  of  a  reliable  Workload  Management  System  are 
multiple.  At  the  local  hospital  level,  the  information  can  be  used  to  adjust 
staff  assignments,  to  compare  workload  on  different  units,  and  to  justify 
requests  for  numbers  and  skill  level  of  nursing  personnel.  The  Naval  Medical 
Command  could  use  the  information  for  facility  planning  of  beds  and  units,  for 
health  care  program  planning,  and  for  identifying  trends  in  patient  needs  based 
on  seasonal,  geographical  and  historical  data.  The  Navy  Military  Personnel 
Command  could  utilize  the  data  for  allocating  nursing  billets  between  hospitals. 
This  data  could  also  he  valuable  as  an  addendum  to  the  Uniform  Staffing 
Methodology  and  Uniform  Chart  of  Accounts  information  since  complexity  of 
patient  care  requirements  within  work  centers  is  pertinent  to  measuring, 
justifying,  and  allocating  resources. 
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Summary 

In  summary,  nurse  staffing  in  the  past  has  been  based  on  historical  data 
related  to  the  number  of  beds  occupied  in  a  clinical  area.  Over  the  years,  the 
nature  and  volume  of  our  workload  has  been  significantly  altered  by  increasingly 
complex  technology,  specialization,  emphasis  on  health  teaching,  personalization 
of  service  to  patients  and  onqoing  evaluation  of  personnel  performance  and 
patient  care.  No  longer  can  staffing  be  managed  on  the  basis  of  patient  census 
alone,  and  we  have  the  added  impetus  of  the  JCAH  1980  Nursing  Standard.  To  this 
end,  the  Navy  Medical  Department  has  developed  a  valid  and  reliable  system  that 
enables  patients  to  be  categorized  according  to  required  nursing  care  and  pro¬ 
vides  guidelines  for  effective  allocation  and  utilization  of  nursing  resources. 
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HOMOGENEITY  OF  DRGs  FOR  NAVY  INPATIENT  DATA: 

A  Critical  Analysis 

Terrence  L.  Kay 

Our  interest  in  case  mix  methodology  resulted  from  a  desire  to  use  DRGs  as  a 
tool  to  account  for  differences  in  average  length  of  patient  stay  among  naval 
hospitals.  The  initial  approach,  which  began  shortly  after  Yale  rr'  id  the 
ICD9-CM  DRGs,  was  to  compare  DRGs  with  other  patient  grouping  *  piques  that 
were  commonly  used  by  the  Navy,  such  as  patient  categories  based  n  e  diagno¬ 
sis  code. 

We  found  that  DRGs  explained  more  variation  in  length  of  stay  than  any  of 
tne  comparison  methods.  DRGs  were  only  slightly  better  than  the  three  digit 
diagnosis  codes,  but  required  about  one-half  the  number  of  groups  in  which  to 
categorize  the  patient  population  as  do  the  diagnosis  codes.  It  is  much  easier 
for  a  manager  to  monitor  467  DRG  groups  rather  than  groups  based  on  over  1,000 
three  digit  codes. 

We  were  concerned,  however,  with  what  was  thought  to  be  a  relatively  low 
proportion  of  explained  variance  in  length  of  stay  using  DRGs.  To  increase  the 
explained  variance,  patients  with  unusually  long  or  short  lengths  of  stay  were 
removed  from  the  data.  The  results  are  summarized  in  Table  1. 

Each  method  of  removing  outliers  has  helped  to  explain  more  variance  in 
length  of  stay,  especially  the  Yale  trim  points  technique.  The  Yale  trim  points 
include  cut  points  for  both  abnormally  short  and  long  lengths  of  stay,  whereas 
the  HCFA  cut  points  are  for  abnormally  long  lengths  of  stay  only.  Note  that 
using  the  long  length  of  stay  reduced  the  effect  of  outlier  cases  for  untrimmed 
data. 

Next,  the  individual  DRGs  were  reviewed  to  ascertain  which  ones  possessed 
’arge  amounts  of  unexplained  variance.  The  statistic  used  to  identify  these 
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Table  1 

EXPLAINED  VARIATION  IN  LENGTH  OF  STAY 
BY  SELECTED  RECORDS,  1982 


Percent  Variation 
Explained 

Selection  Criteria 

Number 

of 

Records 

Length 

of 

Stay 

Log 

Length 
of  Stay 

All  Records 

196,916 

23 

29 

Exclude  Transfer 
(in  or  out) 

185,704 

26 

32 

HCFA  Trim  Points 

191,131 

31 

31 

Yale  Trim  Points 

183,115 

41 

40 

DRGs  was  the  coefficient  of  variation,  which  is  simply  the  ratio  of  the  standard 
deviation  of  the  length  of  stay,  divided  by  the  mean  length  of  stay,  and 
expressed  as  a  percentage.  The  ,ower  this  ratio,  the  lower  the  amount  of 
variation  existing  in  tie  sample  being  studied  and  the  more  homogeneous  the  DRG 
grouping.  A  coefficient  of  variation  of  70%  was  arbitrarily  chosen  as  a  cutoff 
point  to  flag  potential  problem  DRGs. 

The  coefficient  of  variation  tended  to  be  large  across  a  majority  of  the 
DRGs.  Table  2  displays  only  the  results  for  the  first  ten  DRGs,  but  these 
results  are  typical  of  what  was  found  for  most  of  the  case  mix  groupings.  We 
initially  looked  at  the  obvious  reasons  for  this  variability,  again  removing 
transfers  and  using  log  length  of  stay,  but  the  results  indicated  that  most  of 
the  DRGs  were  candidates  for  further  analysis. 

From  the  first  phase  of  this  study  we  concluded  that  DRGs  were  better  tha 
other  patient  grouping  methods.  The  fact  that  they  accounted  for  only  about  . 
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Table  2 

COEFFICIENT  OF  VARIATION  FOR  DRGs  1-10 


DRG 

Number 

Description 

Number 

of 

Cases 

Coefficient 

of 

Variation 

1 

Craniotomy  age  >  18 
except  for  trauma 

112 

86.3 

2 

Craniotomy  for  trauma 
age  >  18 

23 

77.5 

3 

Craniotomy  age  <  18 

34 

91.9 

4 

Spinal  procedures 

141 

118.8 

5 

Extracranial  vascular 
procedures 

140 

125.1 

6 

Carpal  tunnel  release 

0 

- 

7 

Periph  &  cranial  nerve  & 
other  nerv  syst  proc 
age  >  70  and/or  C.C. 

36 

155.1 

8 

Periph  &  cranial  nerve  & 
other  nerv  syst  proc 
age  <  70  w/o  C.C. 

457 

194.6 

9 

Spinal  disorders  & 
injuries 

53 

158.7 

10 

Nervous  System  neoplasms 
age  >  70  and/or  C.C. 

42 

90.7 

quarter  of  the  total  variance  in  length  of  stay  suggested  that  there  must  be 
other  variables  to  explore.  To  identify  these  other  factors,  our  research  has 
progressed  in  two  directions.  First,  variables  that  were  already  available  on 
the  Inpatient  Data  System  were  examined  to  see  if  they  would  explain  additional 
variance  in  length  of  stay.  These  included  patient  related  variables  (age, 
race,  sex,  number  of  medical  diagnoses  and  procedures,  admission  and  discharge 
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status),  hospital  characteristics  (size,  location,  and  teaching  status),  and 
military  unique  variables  (military  status,  length  of  service,  and  pre-existence 
of  condition  prior  to  entry).  Second,  a  pilot  test  was  conducted  to  determine 
whether  differences  in  length  of  stay  within  DRGs  could  be  accounted  for  by 
disease  severity,  a  concept  which  was  operationalized  by  using  patient  acuity 
levels.  Patient  acuity  was  measured  by  the  patient  classification  tool  which  is 
part  of  the  Nursing  Workload  Management  System  discussed  in  the  presentation  by 
COR  Rieder. 

The  procedure  adopted  to  study  the  effect  of  variables  from  the  Inpatient 
Data  System  was  similar  to  that  used  by  the  Yale  researchers  who  developed  the 
original  DRGs.  Each  DRG  was  analyzed  to  see  if  any  of  the  selected  variables 
from  the  inpatient  record  could  explain  additional  variance  in  length  of  stay. 
Therefore,  an  attempt  was  made  to  further  subdivide  each  DRG. 

Each  DRG  was  split  into  subgroups  based  upon  the  results  of  a  stepwise 
multiple  regression  analysis  into  which  the  inpatient  variables  of  interest  were 
entered.  The  criteria  used  for  terminating  the  splitting  process  was  as 
fol lows : 

(1)  statistical  significance  of  the  split  was  not  attained  using 
p  <  .05; 

(2)  less  than  one  percent  of  additional  variance  in  length  of  stay 
was  explained  by  further  split; 

(3)  the  group  to  be  split  contained  fewer  than  100  cases; 

(4)  splitting  would  result  in  a  subgroup  with  fewer  than  10  cases; 

(5)  the  difference  in  the  average  length  of  stay  for  split 
groups  was  less  than  one-half  of  a  day;  and, 

(6)  splitting  according  to  statistical  criteria  resulted  in  uninter¬ 
pretable  subgroups  (e.g.,  age  groups  or  number  of  procedures). 

This  procedure  is  best  illustrated  with  an  example. 

Table  3  shows  the  splitting  process  for  one  selected  DRG.  The  total  samp’e 
was  first  subdivided  on  the  basis  of  active  duty  enlisted  status.  Active  dutj 

*•  .  .  *.  '  .  _  '  .  •  \*N  ‘ 


SUBGROUPS  FOR  DRG  352  USING  1980  DATA 


enlisted  patients  stayed  about  two  days  longer  than  other  patients.  Second,  the 
DRG  was  further  split  on  the  basis  of  whether  the  hospital  was  a  teaching  insti¬ 
tution  or  not.  In  this  case,  there  was  one  day's  difference  in  length  of  stay 
between  the  two  groups  of  hospitals,  with  the  teaching  hospitals  having  the 
shorter  stay.  This  finding  was  somewhat  unusual  since  teaching  hospitals  tend 
to  have  longer  lengths  of  stay  than  non-teaching  hospitals. 
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Using  1980  data,  this  subgrouping  procedure  was  repeated  for  each  DRG 
with  over  100  cases.  To  check  the  consistency  of  the  results,  we  tested  the 
significance  of  each  split  found  for  1980  data  by  using  1982  data.  We 
reasoned  that  the  most  important  factors  would  be  significant  for  more  than 
one  time  period.  Again,  using  the  results  for  DRG  352  as  an  example,  we  found 
that  1982  average  length  of  stay  data  for  enlisted  personnel  was  signif icantly 
greater  than  for  the  other  patients  (See  Table  4).  However,  there  was  no  dif- 
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SUBGROUPS  FOR  DRG  352  USING  1982  DATA 


DRG  352 

Other  male  reproductive 
system  diagnoses 


Other  than  active 
duty  enlisted 


Active  duty  enlisted 


X  =  2.0 
S  =  1.7 
N  =  106 


X  =  4.4 
S  =  6.6 
N  =  197 


Non-teaching 

Hospital 


Teaching 

Hospital 


X  =  2.0 
S  =  1.9 
N  =  42 


X  =  1.9 
S  =  1.5 
N  =  64 


*  not  significant 


ference  found  between  teaching  and  non-teaching  hospitals  in  this  specific 
ORG.  Therefore,  only  the  subgroups  based  on  enlisted  status  were  retained  in 
this  DRG.  The  results  of  this  splitting  process  for  all  the  DRGs  are  sum¬ 
marized  in  Table  5. 
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Table  5 

VALIDATION  OF  1980  DRG  SUBGROUPINGS 
USING  1982  DATA 


1980 

DRGs  (out  of  a  possible  467) 

456 

Additional  groupings 

674 

Total 

1,130 

1982 

DRGs  (out  of  a  possible  467) 

456 

1980  subgroups  significant 

345* 

during  1982 

Total 

801 

*  332  significant  at  <.05;  13  signifiant  at  <.075 
and  means,  s.d.s  were  quite  different. 


Of  the  674  additional  groupings  found  for  1980,  332  were  also  significant 
(p  <  .05)  for  1982.  An  additional  13  groups  had  probability  levels  of  .06  or 
.07.  Since  the  means  of  these  subgroups  were  quite  different,  they  were 
retained.  Two-thirds  of  the  additional  splits  retained  in  1982  were  accounted 
for  by  the  five  factors  listed  in  Table  6. 

The  primary  factors  were  number  of  diagnoses  and  number  of  procedures. 
These  two  factors  are  often  used  as  proxies  for  disease  severity  which  is  less 
than  ideal  because  of  the  potential  for  abuse.  If  a  hospital's  performance 
rating  or  funds  are  dependent  upon  the  number  of  coded  diagnoses  or  procedures, 
there  may  be  an  incentive  to  document  extra  codes  in  a  patient's  record. 
Concomitantly,  the  longer  a  patient  stays  in  the  hospital,  the  more  likely  that 
additional  medical  problems  will  be  identified  which  warrant  additional  proce¬ 
dures.  As  might  be  expected,  transfer  status  was  also  a  significant  factor; 
patients  transferred  from  one  facility  to  another  tend  to  stay  longer  than 
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Table  6 

VARIABLES  USED  TO  FORM  0R6  SUBGROUPS  19 82 


Variable 

Number 

of 

Subgroups 

Percent 

Number  of  diagnoses 

65 

18.8 

Number  of  procedures 

62 

18.0 

Admitted  by  transfer 

39 

11.3 

Active  duty  enlisted  status 

37 

10.7 

Large  teaching  hospital 

31 

9.0 

Other 

111 

32.2 

Total 

345 

100.0 

direct  admissions.  A  military  unique  factor  that  was  a  significant  variable  was 
active  duty  enlisted  status;  enlisted  patients  tend  to  stay  one  or  two  days 
longer  than  other  patients.  Another  important  variable  was  the  characteristics 
of  the  hospital  at  which  the  patient  was  treated.  Patients  treated  at  large 
teaching  hospitals  tend  to  have  a  longer  length  of  stay  than  patients  treated  at 
other  hospitals. 

Overall,  the  additional  variation  explained  by  these  extra  subgroups  is 
not  dramatic.  The  345  additional  subgroups  increased  the  variance  explained 
from  about  23%  to  about  30%.  These  additional  subgroups,  however,  can  be  useful 
in  at  least  two  ways.  One  option  is  to  modify  the  DRGs  for  military  use  by 
adding  these  additional  subgroups.  If  this  option  is  selected  some  labeling 
convention  would  be  necessary  so  that  the  data  can  be  collapsed  back  to  the 
original  ORG  level.  This  would  be  essential  for  comparing  Navy  data  with  that 
from  civilian  hospitals  or  from  the  other  military  services.  Another  alter¬ 
native  is  to  simply  use  the  DRGs  as  they  have  been  defined  by  their  developers. 


but  when  unexpected  differences  among  specific  hospitals  emerge  a  manager  would 
refer  to  the  subgroups  for  possible  explanation. 

As  stated  earlier,  this  further  subgrouping  of  DRGs  was  one  of  two  attempts 
to  account  for  additional  variance  in  length  of  stay.  A  more  recent  effort  has 
been  to  explore  the  possibility  of  using  a  nursing  patient  classification  tool 
as  an  estimate  of  disease  severity.  Our  logic  was  that  the  more  seriously  ill 
the  patient,  the  more  nursing  care  that  patient  would  require.  Since  Dr.  Susan 
Horn  has  demonstrated  that  disease  severity  is  a  major  contributor  to  patient 
length  of  stay,  we  decided  to  see  if  this  relationship  with  length  of  stay  would 
hold  for  required  nursing  care.  Since  patient  classification  data  is  already 
being  collected  for  staffing  purposes,  this  data  was  readily  available,  and 
additional  coders  to  review  patient  charts  were  not  needed. 

The  hospitals  included  in  this  pilot  study  were  selected  from  a  subgroup  of 
those  participating  in  the  Nursing  Workload  Management  System  field  study.  Data 
from  each  selected  hospital  were  obtained  for  September  through  November  1983. 
This  was  the  earliest  time  period  for  which  we  had  both  patient  classification 
data  and  matching  inpatient  data.  The  DRGs  studied  (See  Table  7)  were  selected 
from  among  those  DRGs  which  had  relatively  large  amounts  of  variability  in 
length  of  stay. 

The  matching  of  patient  class  data  with  inpatient  data  was  done  manually, 
using  admission  and  discharge  dates  and  each  patient's  last  name.  So  far,  we 
have  only  matched  about  425  records  and  have  just  begun  data  analysis.  Our  pre¬ 
liminary  findings  have  not  been  what  was  expected.  We  had  anticipated  that 
there  would  be  a  strong  correlation  between  the  average  patient  classification 
and  the  patient  length  of  stay,  but  so  far  this  has  not  been  found. 

The  type  of  problem  we  encountered  can  be  illustrated  with  an  example 
using  DRG  243,  Medical  Back  Problems.  It  was  expected  that  patient  classifi¬ 
cation  data  would  be  useful  for  differentiating  between  those  patients  just 
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Toxic  effects  of  drugs,  age  18-69, 
without  complications/comorbidity 


receiving  a  few  tests  and  requiring  little  care  versus  seriously  ill,  immobi¬ 
lized  patients,  requiring  significant  care.  The  assumption  was  that  the 
seriously  ill  patient  would  both  require  more  nursing  care  and  have  a  longer 
length  of  stay.  For  our  very  small  sample,  this  relationship  did  not 
materialize.  Patients  with  low  average  patient  classifications  actually 
tended  to  stay  somewhat  longer  than  patients  with  higher  average  classifica¬ 
tions.  (See  Table  8). 

Table  8 

AVERAGE  LENGTH  OF  STAY  BY  AVERAGE  NURSE  PATIENT  CATEGORY, 

SAMPLE  OF  RECORDS  FROM  SEPTEMBER  THROUGH  NOVEMBER  1983 

ORG  243  (Medical  Back  Problems) 


Average 

Nurse 

Patient 

Class 

Number 

of 

Cases 

Average 
Length 
of  Stay 

1  -  1.5 

73 

6.8 

1.51  -  2.5 

61 

5.7 

2.51  -  3.5 

1 

14.0 

Total  Sample 

135 

6.4 

At  this  point,  we  can  only  speculate  on  why  this  has  occurred.  One 
possible  explanation  is  that  nursing  care  hour  requirements  for  seriously  ill 
patients  with  long  lengths  of  stay  tend  to  be  concentrated  during  the  first 
few  days  of  hospitalization.  Following  this  short  period  of  high  intensity 
nursing  care,  these  patients  require  a  lower  level  of  nursing  care  during  the 
remainder  of  their  stay.  This  lower  level  of  nursing  care  required  during  the 
latter  part  of  the  patient  stay  deflates  the  value  of  the  average  patient 


classification  for  these  patients.  For  such  patients,  the  average  classifica¬ 
tion  may  be  an  inadequate  measure  to  express  the  level  of  care  required.  A  more 
promising  approach  may  be  to  analyze  the  patient  classification  data  using  only 
the  first  few  days  of  admission,  or  to  use  as  a  dependent  variable  the  maximum 
class  at  which  a  patient  was  rated.  We  also  plan  to  look  at  the  nursing  care 
time  required  by  the  patient  rather  than  just  the  patient  classification  cate¬ 
gory.  A  typical  problem  with  using  patient  classification  data  to  account  for 
length  of  stay  can  be  illustrated  by  comparing  data  from  two  patients  in  DRG 
243.  Patient  A  stayed  10  days  and  was  rated  as  a  Class  I  for  every  day 
following  the  day  of  admission.  Patient  B  stayed  two  days  and  was  a  Class  II 
for  both  days.  There  is  no  positive  correlation  in  this  example  between  patient 
class  and  length  of  stay.  Unfortunately,  such  examples  are  fairly  easy  to  find. 
To  see  if  the  differences  in  length  of  stay  for  these  two  patients  could  be 
explained  by  factors  other  than  severity,  all  available  data  from  the  Navy 
Inpatient  Data  System  was  examined.  The  findings  revealed  nothing  obvious. 

Both  patients  were  assigned  to  the  same  DRG  and  had  the  following  charac¬ 
teristics: 

a.  both  were  direct  admissions  to  the  same  hospital; 

b.  both  were  male  enlisted  personnel  - 

Patient  A  is  an  E4,  age  26, 

Patient  B  is  an  E7,  age  31; 

c.  both  had  only  one  diagnosis  code,  and  the  code  was  very  similar  - 

Patient  A  (Diagnosis  Code  7222) 

Patient  B  (Diagnosis  Code  7221);  and, 

d.  neither  had  any  major  or  minor  procedures  listed  on  the  record. 

At  this  point,  the  easiest  way  to  identify  significant  factors  not  accounted 
for  by  DRGs  may  be  to  pinpoint  such  patients  and  then  examine  their  medical 


records . 
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Since  the  Army  researchers  plan  to  use  patient  classification  data  in  a 
similar  way,  we  will  be  very  interested  in  their  findings.  It  should  be  empha¬ 
sized  that  we  have  not  addressed  the  importance  of  patient  classification  data 
for  other  purposes,  especially  in  accounting  for  differences  in  hospital  costs. 
Since  nursing  care  accounts  for  50*  of  all  hospital  costs,  this  variable  must  be 
explored  further. 

Based  upon  the  New  Jersey  results  we  thought  that  DRGs  were  not  as  useful 
for  explaining  length  of  stay  differences  between  Navy  hospitals  as  they 
appeared  to  be  for  the  civilian  community.  DRGs  explained  less  than  25%  of  the 
variation  in  Navy  data,  whereas  New  Jersey  results  showed  over  40%  variance 
explained.  A  recent  conversation  with  Dr.  Susan  Horn  indicated  that  the  Navy 
finding  may  actually  be  in  line  with  those  of  most  civilian  hospitals.  Dr.  Horn 
stated  that  the  civilian  hospitals  in  which  she  has  conducted  research  have 
explained  between  25-30%  of  the  variance  in  length  of  stay  using  DRGs.  In  her 
opinion,  by  modifying  DRGs  to  account  for  enlisted  personnel  and  patient 
transfers  (factors  which  tend  to  be  more)  important  for  the  Navy  than  for  most 
civilian  hospitals),  we  could  equal  or  exceed  the  variance  explained  by  DRGs  in 
most  civilian  hospitals. 

We  plan  to  continue  our  search  for  important  factors  contributing  to  length 
of  stay,  especially  those  that  may  be  unique  to  the  military.  After  the  Army 
and  the  Air  Force  have  had  the  opportunity  to  address  the  issue  of  additional 
DRG  subgroups,  it  would  seem  wise  for  the  three  Services  to  scrutinize  each  DRG 
to  see  if  agreement  on  specific  subgroups  can  be  achieved.  The  identification 
of  and  agreement  on  a  standardized  unit  of  measure  is  an  essential  step  towards 
monitoring  and  comparing  hospital  productivity. 


CASE  MIX  AND  DATA  QUALITY 
MAJ  Lawrence  M.  Leahy,  MSC,  USA 

INTRODUCTION 

When  a  military  hospital  discovers  that  its  planned  budget  exceeds  allowed 
budget  guidance,  the  hospital  commander  may  argue  that  the  hospital  is  entitled 
to  budget  adjustment  because  of  the  type  of  patients  treated  and  the  services 
provided.  This  arguement,  which  has  been  supported  by  econometric  and  hospital 
cost  studies  conducted  in  the  late  60' s  and  early  70’ s,  has  recently  begun  to 
gain  acceptance  by  higher  headquarters.  The  motivation  for  this  attention 
derives  from  a  number  of  sources:  decrease  in  medical  workload,  increase  in 
technology,  and  dissatisfaction  with  the  Medical  Care  Composite  Unit  (MCCU). 

The  Army  Surgeon  General  and  the  Commander,  US  Army  Health  Services  Command 
(HSC),  are  considering  development  of  an  expanded  case  mix  approach  to  measure 
productivity  in  the  Army  health  care  system  because  of  dissatisfaction  with  the 
MCCU.  HSC  is  evaluating  the  use  of  Diagnosis  Related  Groups  (DRGs)  as  part  of 
this  approach.  While  a  DRG  system  has  the  potential  to  benefit  the  military 
health  care  sector,  there  is  a  need  to  evaluate  the  accuracy  of  the  medical 
record  system  which  will  be  used  to  develop  the  use  of  DRGs  or  any  other  case 
mix  method. 


REL1ABILIY  OF  DATA 

Independent  support  for  the  need  to  evaluate  the  reliability  of  the  input 
data  to  case  mix  comes  from  many  sources. ^  However,  the  initial  impetus  came 
from  studies  conducted  by  the  Institute  of  Medicine  (IOM),  which  assessed  the 
reliability  of  information  abstracted  from  patients'  medical  records. ?  While 
tee  studies  were  conducted  separately,  their  major  objective  was  the  same--to 
determine  the  reliability  of  selected  informational  items.  Items  assessed 
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date  of  hospital  admission,  date  of  discharge,  sex,  date  of  birth  or  age,  source 
of  payment,  principal  diagnosis,  admitting  diagnosis,  other  diagnoses,  principal 
and  other  procedures,  race,  marital  status,  and  disposition. 3, 4  Although  pre¬ 
vious  studies  have  been  conducted  on  quality  of  medical  records,  the  IOM  studies 
are  unique  for  the  following  reasons: 

1.  They  were  national  in  scope. 

2.  They  involved  an  independent  examination  of  the  medical  record  by 
someone  other  than  the  individual  initially  completing  the 
abstract . 

3.  Conclusions  were  derived  from  a  thorough  review  of  the  medical 
records . 5 

The  first  study  "concentrated  on  data  derived  from  hospital  discharge 
abstracts  processed  by  private  abstract  services"  which  covered  about  65  percent 
of  all  discharges  from  short-stay  general  hospitals.  Based  on  the  sampling  plan 
and  a  weighted  analysis,  data  were  generalized  nationally  to  all  1974  discharges 
for  Medicare  and  Medicaid  patients  who  were  treated  in  hospitals  subscribing  to 
the  participating  abstract  services  or  the  larger  hospitals  with  internal  data 
systems.  The  second  study  evaluated  the  data  from  Medicare  claims  submitted  by 
hospitals  to  fiscal  intermediaries  and  eventually  to  the  Health  Care  Financing 
Administration  (HCFA).  The  Medicare  study  results  were  applied  to  "all  Medicare 
beneficiaries  age  65  and  over,  who  were  discharged  from  hospitals  during  1974. "6 
The  final  IOM  study  focused  on  data  collected  by  the  National  Hospital  Discharge 
Survey  (NHDS),  a  voluntary  survey  that  yielded  statistics  on  utilization  of  all 
general  and  special  non-federal,  short-stay  hospitals  in  the  United  States.'7 
Because  the  methods  of  the  NHDS  study  were  similar  to  the  previous  two  IOM 
studies,  the  results  could  be  applicable  to  all  NHDS  data  collected  in  1977. 

In  all  three  studies  the  non-medical  data  reliability  was  of  a  high  level. 
However,  reliability  of  diagnositic  information  was  questioned  by  all  three 
studies  (Tables  1  and  2)  and  elicited  "serious  reservations  about  the  adequacy 


207 


Table  1 


DISCREPANCIES  BETWEEN  THE  IOM  FIELD  TEAM  AND  ORIGINAL  DATA  SOURCE 
(PRIVATE  ABSTRACT  OR  MEDICARE  RECORD)  FOR  SELECTED  DATA  ITEMS 


Data  Item 

Weighted  Percent 
Medicare  Record 

With  No  Discrepancy 
Private  Abstract 

Admission  Date 

99.5% 

99.7% 

Discharge  Date 

99.3 

99.2 

Date  of  Birth/Age 

- 

- 

Sex 

99.4 

99.1 

Payment  Source 

- 

98.1 

Principal  Diagnoses  (4) 

57.2 

65.2 

Additional  Diagnoses 

74.5 

- 

Principal  Procedures 

78.9 

73.2 

Source:  L.  K.  Demlo,  P.  M.  Campbell,  and  S.  S.  Brown,  "Reliability  of 

Information  Abstracted  from  Patients'  Medical  Records"  Medical  Care 
16  (December  1978)  p.  999,  Table  1. 


Table  2 

ADEQUACY  OF  THE  FACE  SHEET  FOR  ABSTRACTING  NHDS 
DATA  (WEIGHTED  PERCENT) 


Data  Item 

Adequate 

Insufficient 

Inaccurate 

Total 

Admission  Date 

98.8% 

0.8% 

0.4% 

100.0% 

Discharge  Date 

99.2 

0.6 

0.2 

100.0 

Date  of  Birth/Age 

96.9 

0.8 

2.3 

100.0 

Sex 

97.4 

1.0 

1.6 

100.0 

Race 

91.7 

1.0 

7.3 

100.0 

Marital  Status 

95.8 

0.2 

4.0 

100.0 

Principal  Expected  Source  Payment 

97.7 

0.4 

1.9 

100.0 

Additional  Expected  Source  Payment 

95.9 

0.1 

4.0 

100.0 

Patient  Disposition 

24.8 

2.0 

63.2 

100.0 

Principal  Diagnosis 

47.3 

49.3 

3.4 

100.0 

Principal  Procedure 

5.9 

90.6 

3.0 

100.0 

Source:  L.K.  Demlo  and  P.  M.  Campbell,  "Improving  Hospital  Discharge  Data: 

Lessons  From  the  National  Hospital  Discharge  Survey"  ^edical_Care 
19  (October  1981)  p.  1037,  Table  8. 
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of  existing  hospital  discharge  information" .8.9  Because  these  three  studies 
were  conducted  during  the  time  period  when  four-digit  coding  of  diagnoses  was 
standard,  the  evaluators  believed  there  would  be  more  concern  about  the  relia¬ 
bility  of  hospital  discharge  data  when  a  five-digit  coding  scheme  was 

utilized. 10 

Since  the  10M  studies  of  the  reliability  of  discharge  data  found  high  error 
rates,  additional  studies  were  conducted  by  Richard  F.  Corn,  Cynthia  Barnard, 
and  Truman  Esmond,  and  H.  P.  Doremus  and  Elana  M.  Michenzi.  Corn's  research  was 
sponsored  by  the  National  Center  for  Health  Statistics  and  was  an  assessment  of 
the  state  of  quality  control  procedures  utilized  by  abstracting  services.  Using 
a  combination  of  letters,  telephone  calls  and  personal  interviews,  information 
was  gathered  on  quality  control  procedures  reportedly  followed  by  private 
abstracting  services,  HUS,  and  HCFA  in  its  20  percent  Medicare  sample.  In 
studying  procedures  utilized  in  preparation  and  processing  of  either  an  abstract 
or  claim  form.  Corn's  study  team  raised  four  significant  points: 

1.  Each  of  the  three  major  national  sources  of  hospital  discharge  data... 
need  improvement,  particularly  in  the  verification  of  abstracted  infor¬ 
mation,  error  correction,  and  training  programs. 

2.  Validity  of  the  (HOS)  data  may  be  limited  because  the  abstractor  is 
instructed  to  refer  only  to  the  face  sheet  of  the  medical  record. 

3.  Quality  control  procedures  of  the  Medicare  system  are  limited  and  vary 
across  the  country. 

4.  Steps  should  be  taken  to  improve  the  quality  of  discharge  data  in  view 
of  the  importance  of  accurate  data.*! 

The  study  conducted  by  Barnard  and  Esmond  had  three  areas  of  focus:  the 
ambiguity  inherent  in  use  of  diagnosis  and  procedure  coding  schemes  and  their 
appl icabi 1 ity;  the  source  of  Medicare  bill  data  and  its  relevance  to  DRG  assign¬ 
ment;  financial  and  case  mix  implications  of  discrepancies  between  billing  data 
and  medical  records  data.  Using  a  random  50  percent  sample  of  Medicare  inpa¬ 
tients  discharged  from  Rush-Presbyter i an-St .  Luke's  Medical  Center  during  the 
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year  beginning  1  May  1979,  Barnard  and  Esmond  compared  concurrently  determines 
discharge  data  with  retrospectively  determined  discharge  data.^  In  comparing 
the  two  types  of  discharge  data,  the  study  showed  that  "in  53  percent  of  the 
cases... the  retrospecti vely  coded  diagnosis  had  not  been  cross-coded  at  all  on  a 
concurrent  basis. (Table  3)  When  applying  this  coding  difference  to  the  ukg 
payment  process,  the  study  showed  that  "reimbursement  based  on  concurrent  data, 
with  case  mix  and  local  wage  index  as  the  sole  determinant  of  payment  amount, 
would  have  averaged  $600  less  than  reimbursement  based  on  retrospective  data."^ 
(Table  4)  In  analyzing  their  research  data,  Barnard  and  tsmond  were  quite 
emphatic  in  the  use  of  concurrent  data  as  a  case  mix  data  base.  Specifically, 
they  observe,  "the  data  upon  which  the  Health  care  Financing  Administration 
plans  to  construct  its  new  reimbursement  mechanism  may  be  inappropriate  for  that 
use... the  current  case  mix  based  on  Diagnosis  Related  Groups  cannot  be  used  to 
measure  resource  use  and  therefore  will  not  accurately  predict  reimbursement 
needs . 

Additional  support  for-  Barnard  and  Esmond's  hypothesis  concerning  t fie 
reliability  of  the  MCI  A  data  base  has  been  provided  by  Doremus  and  Michen/i  who 
compared  data  from  the  MEDPAR  File,  the  original  medical  record  discharge  ui dei , 
and  a  reabstracted  record.  Based  on  the  authors'  comparison,  an  analysis  was 
made  of  each  item's  effect  upon  OKI!  cl  ass  if  leal  ion  and  the  Medicare  reimburse 
ment  ceiling  for  University  Hospitals  of  Cleveland.  Study  results  show: 

1.  In  47.7  percent  of  the  cases  studied  tin-  principal  diagnosis  rode  was 
different  in  the  HCFA  data  base  than  in  the  patient's  original  medi.al 
record  discharge  order  . 

2.  A  comparison  of  the  principal  diagnosis  code  on  the  original  di  •.<  harge 
order  with  the  code  on  the  reahstr acted  record  revealed  a  different  rode 
in  3?.l  percent  of  the  cases  studied.  (Table  ‘> ) 

1  he  variation  in  diagnostir  and  surgical  information  between  Un-  IK. I  n 
data  base  and  information  found  m  the  original  disi harge  or der  resulted 
in  a  different  URG  r lassif icat  ion  for  61.1  percent  of  the  patients  in 
the  study. 


3. 
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Table  3 

MATCHING  RETROSPECTIVE  PRINCIPAL  AND  SECONDARY  CODES  TO 

CONCURRENT  CODES 


Principal 
Di agnosis 

Secondary 

Diagnosis 

Principal 

Procedure 

Secondary 

Procedure 

Retrospective  code 
matches  concurrent  (%) 

35.05 

4.77 

31.32 

5.34 

Retrospective  principal  as 
concurrent  secondary  or 

6.38 

6.45 

4.43 

6.96 

vice  versa  (%) 

2.82 

2.41 

.54 

.40 

No  retrospective 

code  found  (%) 

2.62 

41.60 

15.83 

40.19 

Retrospective  code 

not  in  concurrent  (%) 

53.12 

44.76 

47.88 

47.11 

Total 

100.00 

100.00 

100.00 

100.00 

NOTE:  Rounded  numbers  may  not  add  to  100% 

Source:  C.  Barnard  and  T.  Esmond.  "ORG-Based  Reimbursement :  The  Use  of 
Concurrent  and  Retrospective  Clinical  Data"  Medical  Care  19: 
(November  1981),  p.  1077,  Table  2. 


Table  4 

DISCREPANCIES  IN  RETROSPECTIVE  VERSUS 
CONCURRENT  DRGs 


No.  of 
Cases 

% 

Retrospective  DRG 
same  as  concurrent 

684 

22.984 

Retrospective 
differs  from 
concurrent 

DRG 

2292 

77.016 

Source:  C.  Barnard  and  T.  Esmond,  "DRG-Based  Reimbursement:  The  Use  of 
Concurrent  and  Retrospective  Clinical  Data"  Medical  Care  19 
(November  1981):  1078,  Table  3. 


Table  5 


DIAGNOSTIC  DATA  DISCREPANCIES  BETWEEN  THE  HEALTH  CARE  FINANCE 
ADMINISTRATION  (HCFA)  RECORD,  ORIGINAL  DISCHARGE  ORDER, 

AND  REABSTRACTED  RECORD 


HCFA  Record 
Compared  With 
Original  Discharge 
Order 

Original  Discharge 
Order  Compared 

With  Reabstracted 
Record 

Data  Item 

Number 

% 

Number 

% 

Disagreement  on  principal 
diagnositc  code  (all  digits 
-ICUA-8) 

125 

47.7 

84 

32.1 

Case  with  indication  of  one 
additional  diagnosis  on  at 
least  one  record 

159 

61.0 

201 

76.7 

Disagreement  on  first  listed 
additional  diagnosis  when 
both  records  show  an  addi¬ 
tional  diagnosis/diagnoses 

0 

0 

77 

38.3 

Cases  with  indication  of  addi¬ 
tional  diagnosis  on  only  one 
record 

159 

100.0 

54 

26.9 

Cases  with  no  indication  of 
additional  diagnosis  on 
either  record 

103 

39.3 

61 

23.3 

Source:  H.  D.  Doremus  and  E.  M.  Michenzi,  "Data  Quality  :  An  Illustra¬ 
tion  of  its  Potential  Impact  Upon  a  Diagnosis  Related  Group's 
Case  Mix  Index  and  Reimbursement"  Medical  Care  (October  1983): 
p.  100,  Table  1. 


4.  In  37  percent  of  the  cases  studied  the  DRG  classification  differed  when 
classification  was  compared  based  on  diagnostic  and  surgical  information 
from  the  original  discharge  order  and  the  reabstracted  record. 

5.  Using  data  from  the  HCFA  data  base  for  case  mix  reimbursement  would  lead 
to  a  significantly  understated  level  of  Medicare  reimbursement. 


The  results  of  the  study  demonstrated  that  there  is  inaccurate,  incomplete 
recording  of  diagnostic  and  surgical  information  in  the  medical  record,  which 
reinforced  the  findings  of  Barnard  and  Esmond's  study  and  brought  out  the 
requirement  for  additional  research  on  data  quality.^ 

THE  OARNALL  ARMY  COMMUNITY  HOSPITAL  STUDY 

The  foregoing  studies  tend  to  suggest  that  data  on  discharge  abstracts  may 
be  inappropirate  for  use  in  developing  a  case  mix  system  and  that  further 
research  on  data  quality  is  necessary  prior  to  implementing  any  case  mix  system. 
A  study  on  data  quality  has  recently  been  initiated  at  Darnall  Army  Community 
Hospital  (DACH),  which  is  a  250-bed  academic  community  hospital  located  at  Fort 
Hood,  Texas. 

Objectives  of  the  Study 

The  study  is  comparing  concurrently  determined  diagnostic  data  with  medical 
records  retrospectively  determined  diagnostic  data.  The  goal  is  to  evaluate  the 
extent  to  which  concurrently  collected  data  can  be  substituted  for  retrospective 
diagnosis.  Because  the  study  is  being  used  as  a  Graduate  Research  Project,  it 
is  also  determining  reasons  for  discrepancies  and  determining  specific  recommen¬ 
dations  to  decrease  the  discrepancy  rate. 

Data  and  Method 

Data  were  obtained  from  DACH 1 s  Patient  Administration  Division  for  patient 
dispositions  during  January  1984.  The  data  elements  obtained  were  patient 
number,  principal  diagnosis  (concurrent  and  retrospective),  additional  diagno¬ 
ses,  sex,  principal  service  utilized,  and  beneficiary  category.  To  accurately 
assess  the  difference  between  concurrent  and  retrospective  diagnostic  data,  all 
patients  admitted  for  delivery,  newborns,  absent  sick,  carded  for  record,  ar.  i 
medical  board  patients  were  eliminated  from  the  study. 
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Study  Results 

Administrative  information  (sex,  principal  service,  and  beneficiary  cate¬ 
gory)  was  highly  reliable  in  our  study.  However,  there  was  much  less  reliabil¬ 
ity  for  cl  in 'cal  data.  The  principal  retrospective  diagnosis  was  coded  prin¬ 
cipal  in  concurrent  diagnosis  in  72  percent  of  the  cases.  In  20  percent  of  the 
cases  the  concurrent  code  was  matched  by  an  additional  retrospective  diagnosis. 
Thus,  the  retrospective  coded  diagnosis  had  not  been  coded  on  a  concurrent  basis 
in  eight  percent  of  the  cases.  (Table  6) 

Table  6 

MATCHING  RETROSPECTIVE  PRINCIPAL  AND  ADDITIONAL 
DIAGNOSIS  TO  CONCURRENT  DIAGNOSIS 


Retrospective  diagnosis  matches 
concurrent  diagnosis  (%) 

72.0 

Retrospective  additional  diagnosis 
matches  concurrent  diagnosis  (X) 

20.0 

Retrospective  diagnosis  does  not 
match  concurrent  diagnosis  (X) 

8.0 

Total 

100.0 

Tentative  Conclusions 

While  our  preliminary  study  results  show  a  discrepancy  rate  less  than  that 
found  in  other  studies  previously  referenced,  there  is  a  sufficient  problem 
indication  which  questions  the  reliability  of  patient  data.  It  is  difficult  to 
make  a  general  statement  about  the  quality  of  medical  records  data  throughout 
Health  Services  Command  because  our  data  is  preliminary  and  our  patient  popula¬ 
tion  is  quite  unique.  However,  prior  to  constructing  a  new  case  mix  budgetary 
system,  there  is  a  need  for  additional  research.  We  are  continuing  the  current 
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INPATIENT  PERFORMANCE  MEASUREMENT  UPDATE: 

DATA  CONVERSION  AND  INITIAL  CASE  MIX  ACCOUNTING 

Velda  Austin 

MAJ  Stuart  W.  Baker,  MSC,  USA 
INTRODUCTION 

The  Patient  Administration  Systems  and  Biostatistics  Activity  ( PAS&BA )  has 
been  performing  hospital  profile  analyses  in  one  form  or  another  for  many  years. 
The  analyses  have  been  performed  using  the  Individual  Patient  Data  System  (IPDS) 
data  which  are  centrally  maintained  by  the  Health  Care  System  Support  Activity 
(HCCSA)  at  Fort  Sam  Houston,  Texas.  Although  PAS&BA  depends  on  HCSSA  for  com¬ 
puter  processing,  many  general  retrieval  programs  have  been  written  by  statisti¬ 
cians  assigned  to  the  activity.  General  retrieval  programs  form  the  basis  for  a 
full  range  of  information  retrieval  and  biostatistical  analysis  capabilities 
covering  both  descriptive  and  inferential  techniques.  Retrieval  techniques  have 
primarily  been  employed  to  support  hospital  information  requirements.  The 
various  programs  are  used  to  provide  hospitals  with  a  wide  range  of  products 
which  include:  lists  of  inpatient  records  by  primary  diagnoses  helpful  in 
evaluating  a  hospital's  case  mix;  length  of  stay  data  by  diagnositc  code  groups 
for  internal  utilization  review  and  monitoring;  and  comparative  length  of  stay 
studies  intended  to  help  Medical  Treatment  Facilities  (MTF)  profile  their  data 
within  peer  groups. 

Fundamental  to  many  of  the  IPDS  based  analyses  of  hospitals  has  been  the 
issue  of  determining  which  hospitals  could  be  labelled  "similar"  to  the  hospital 
under  examination.  The  issue  of  determining  which  hospitals  were,  in  fact, 
similar  to  others  was  given  only  limited  examination  until  two  years  ago  when  it 
became  apparent  that  MTF  were  requesting  more  analyses  focusing  on  normative/ 
comparative  length  of  stay  issues  within  peer  group  hospitals.  The  most  common 
criteria  for  grouping  hospitals  were  those  created  by  resources  management 


staff.  Other  groups  were  defined  by  type  of  population  supported  or  mission, 
e.g..  Training  and  Doctrine  Command  ( TRADOC )  posts  which  have  high  trainee 
hospitalization  rates  or  tertiary,  acute  care  facilities  which  treat  a  high  pro¬ 
portion  of  retirees  and  their  dependents.  Groupings  were  also  formed  on  the 
basis  of  bed  size,  dispositions  and  total  patient  days.  None  of  these  groupings 
were  sufficiently  sensitive  to  case  mix  variations,  and  the  resulting  analyses 
using  diagnostic  data  for  length  of  stay  studies  produced  relatively  large 
unexplained  variations  within  each  group.  The  desire  to  improve  the  homogeneity 
of  the  groups  led  PAS&BA  to  initiate  research  in  two  areas.  The  first  area  was 
to  investigate  the  availability  of  some  type  of  patient  classification  technique 
that  would  be  better  than  using  hospital  structural  criteria  as  a  proxy  for  case 
mix  and  secondly,  to  employ  an  appropriate  patient  classification  technique  in 
order  to  understand  the  impact  of  case  mix  on  quality  assurance  and  utilization 
review  issues  among  MTF.  Both  of  these  pursuits  have  been  encompassed  within 
the  scope  of  the  Army  Medical  Department  (AMEDD)  Performance  Measurement  Study 
(PMS)  along  with  other  important  issues  helpful  in  explaining  hospital  and 
health  system  performance. 

As  a  point  of  departure,  a  study  was  generated  to  explore  the  changes  in 
case  mix  and  length  of  stay  over  the  last  ten  years  among  AMEDD  inpatient  data. 
This  analysis  was  done  using  International  Classification  of  Diseases,  Ninth 
Revision  ( I  CD-9 )  code  groups  to  help  define  case  mix.  The  analysis  of  the  IPDS 
data  base  and  related  resource  measures  from  the  Medical  Summary  Reporting 
System  indicated  that  if  our  workload  performance  measures  were  more  sensitive 
to  the  type  of  disposition  and  clinic  visit  and  less  dependent  upon  bed  day 
measures,  the  workload  picture  for  the  AMEDD  would  be  brighter.  Length  of  stay 
among  AMEDD  MTF  has  steadily  declined  during  the  last  five  years  (FY79-83). 

This  conclusion  was  reached  after  categorizing  patient  data  by  a  number  of  the 
variables  found  in  the  basic  Individual  Patient  Data  System  (IPDS)  inpatient 
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record.  The  pattern  of  decreased  length  of  stay  parallels  the  pattern  in  the 
civilian  sector.  One  might  conclude  that  an  emphasis  on  bed-day-type  measures 
does  not  provide  the  proper  incentives  for  MTF  Commanders  to  utilize  their 
resources  properly.  In  fact,  variations  in  length  of  stay  among  MTF  for  the 
same  disease,  controlling  for  age  and  sex,  frequently  leave  the  conclusion  that 
the  variant  length  of  stay  (LOS)  was  due  largely  to  local  command  policy  or 
other  professional  perogatives.  Moreover,  the  emphasis  on  establishing  an 
effective  ambulatory  care  delivery  system  demands  an  alternative  to  the  current 
outpatient  weighting  system.  The  outpatient  weighting  system,  like  the  inpa¬ 
tient  weights,  are  not  sensitive  to  the  type  of  case  and  thus  fail  to  reflect 
tne  actual  resources  and  provider  time  expended  in  the  delivery  of  care. 

The  IPOS  review  revealed  that  our  MTF  are  hospitalizing  more  patients  in 
older  age  groups  and  that  these  older  patients  are  becoming  increasingly  preva- 
|  \;r.r  in  IPOS  over  time  (See  Figure  1).  In  a  related  examination,  viewing  IPOS 

in  terms  of  the  beneficiary  category  of  patients  hospitalized,  patients  defined 
as  "other"  (for  the  purposes  of  this  analysis,  "other"  category  accumulated  all 
j  patient  beneficiary  categories  not  accounted  for  as  active  duty  military  or 

•  dependents  of  active  duty  military)  represented  an  increasingly  significant  pro¬ 

portion  of  the  inpatient  data  over  the  duration  of  the  study  (See  Figure  2). 
using  some  of  the  initial  IPOS -MG  data,  a  correlation  was  computed  with  the 
case  mix  index  and  the  percent  of  "ether"  patients.  For  FY  82,  the  MTFs  were 
grouped  in  three  categories;  MEOCENs,  large  MTF  and  small  MTF.  The  medical  cen- 
uu'S  demonstrated  a  nearly  linear  relationship  between  case  mix  index  and  the 
percent  of  "other"  patient  category  data.  The  coefficient  of  determination  w3S 
calculated  to  be  .955  (See  Figure  3  and  4). 

4n  examination  of  selected  elements  of  the  Medical  Care  Composite  Unit 
■KIC'J)  or  Composite  Work  Unit  (CWU)  demonstrated  that  MTFs  have  actually 
increased  workload.  Both  the  number  of  patients  disposit loned  and  outpatient 


Table  portrays  percent  of  all  Inpatient  dispositions  from  US  Antv  hospitals 
by  patient  category. 
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encounters/visits  have  shown  consistenL  growth  overtime.  Perhaps  as  a  result  of 
increased  production  vis-a-vis  dispositioned  patients  and  increased  ambulatory 
care  efforts,  the  average  length  of  stay  has  dropped  signficantly  with  a 
corresponding  decline  in  average  daily  patient  load.  As  observed  by  Joyce 
Hutchins,  the  statistician  who  developed  the  IPOS  trend  analysis,  using  the 
MCCU/CWJ  measure,  it  would  appear  that  AMEDD  MTF  are  being  penalized  for  being 
more  efficient. 

Several  points  were  found  to  be  indicative  of  a  trend  in  IPDS  over  the  CY 
79-83  period  indicating  changes  in  case  mix/case  complexity  length  of  stay: 

Key  Points  with  Respect  to  Length  of  Stay 

*  An  increase  in  the  number  and  percent  of  female  patients  treated 
coupled  with  their  consistently  shorter  LOS. 

*  An  increase  in  the  number  of  cases  and  percents  of  dispositions 
accumulated  to  OB/GYN  and  Pediatric  clinic  services  combined. 

*  A  decrease  in  the  number  and  percent  of  injury  cases  and  their 
typically  longer  lengths  of  stay. 

*  A  decrease  in  dispositions  for  failure  to  meet  medical  procure¬ 
ment  standards  (these  cases  are  primarily  handled  on  an  outpatient 
basis  now). 


Key  Points  with  Re sp ect  to  Increa sin g  Complexity 


t 

★ 

There  are  more  diagnoses  being  coded  per  IPDS  record. 

i 

* 

There  has  been  a  slight  increase  in  the  number  of  deaths  among 
inpatients  . 

i 

• 

★ 

The  number  and  percent  of  inpatients  ...  eated  who  are  fifty 
years  of  age  and  older  has  increased. 

i 

- 

★ 

The  number  and  percent  of  active  duty  military  patients  has 
decreased. 

| 

★ 

Average  length  of  stay  has  increased  progressively  by  age 
group.  Similarly,  tne  number  of  diagnoses  and  procedures 
coded  per  record  also  increased  with  the  older  age  groups. 

< 

* 

f 

★ 

While  hospital  stays  for  lower  enlisted  have  decreased, 
average  length  of  stay  for  more  senior  ranks  increased. 

• 

►’ 

•  -  . 

Having  analyzed  the  IPDS  data  and  related  resource  measures  from  a  tradi¬ 
tional  perspective,  the  PMS  study  efforts  began  examining  the  applicability  of 
of  the  Diagnosis  Related  Groups  to  AMEDD  inpatient  Data.  This  effort  caused 
the  study  to  address  a  major  nosological  issue.  The  DRGs  currently  used  by 
HHS,  HCFA  were  formed  using  codes  documented  in  the  International  Classifica¬ 
tion  of  Diseases,  Ninth  Revision  with  Clinical  Modification  (ICD-9-CM).  The 
civilian  hospital  industry  and  the  Veterans  Administration  adopted  this  coding 
classification  system  in  1979.  Conversely,  the  three  Services  elected  to  use 
the  I  CD-9  and  International  Classification  of  Procedures  in  Medicine  (ICPM)  for 
diagnoses  and  procedures,  respectively.  The  Services  implemented  ICD-9  and  ICPM 
coding  in  calendar  year  1980.  Therefore,  the  first  problem  encountered  in  the 
study  effort  was,  "how  to  convert  our  AMEDD  data  to  the  coding  system  (ICD-9-CM) 
that  was  compatable  with  the  Health  Systems  International  Grouper."  The 
"grouper"  is  the  software  used  to  assign  individual  patient  records  to  one  of 
the  470  DRGs.  Before  examining  the  initial  products  of  our  DRG/case  mix  analy¬ 
ses,  we  will  review  the  methodology  used  to  convert  and  group  Army  inpatient 
data . 

CONVERSION  AND  GROUPING  OF  ARMY  INPATIENT  DATA 
Measuring  performance  in  inpatient  care  is  very  difficult  without  some 
benchmark  of  comparison.  Since  the  Department  of  Defense  medical  departments 
use  ICD-9  and  ICPM  for  their  medical  coding  and  civilian  and  Veterans  Admin¬ 
istration  health  care  facilities  code  by  ICD-9-CM,  it  is  very  difficult  for  the 
services  to  make  comparisons  other  than  with  other  military  hospitals.  It  was 
recognized  that  some  type  of  diagnoses  grouping  for  performance  measurement  was 
necessary.  Two  major  efforts  have  been  made  in  this  direction.  The  first 
effort  was  directed  at  finding  an  appropriate  and  acceptable  method  for  con¬ 
verting  ICD-9  and  ICPM  diagnoses  and  surgical  procedures  into  ICD-9-CM.  If  such 


a  map  were  possible,  then  that  would  enable  Army  hospitals  not  only  to  compare 
their  data  with  nonmilitary  medical  treatment  facilities,  but  also  to  use  cer¬ 
tain  state-of-the-art  grouping  techniques  to  facilitate  case  mix  measurement. 

The  code  conversion  map  used  in  the  study  group's  initial  efforts  was  one  devel¬ 
oped  by  Mr.  Kay,  from  the  Naval  School  of  Health  Sciences  (NSHS)  in  conjunction 
with  the  December  1981  version  Health  Systems  International  Grouper  Program  for 
assigning  records  to  Diagnoses  Related  Groups  (DRGs). 

A  DRG  data  base  was  created  for  the  study  group  using  the  last  three  fiscal 
years  (FY  81-FY  83)  of  inpatient  records  from  US  Army  hospitals  Worldwide.  This 
amounted  to  approximately  1.2  million  records.  The  records  of  active  duty  Army 
personnel  treated  in  civilian  hospitals  for  their  entire  period  of  hospitaliza¬ 
tion  were  not  included  in  this  data  base;  neither  were  incomplete  records,  i.e., 
records  with  clinical  data  missing,  nor  Carded  for  Record  Only  (CRO)  records. 

The  Army  inpatient  record  is  240  characters  in  length.  It  contains  the  usual 
demographic  data  found  on  most  hospital  abstracts,  such  as  sex,  age,  and  race, 
along  with  up  to  eight  diagnoses  and  eight  different  surgical  procedures. 

Certain  data  elements  unique  to  DOD  or  Army  are  collected,  such  as  convalescent 
leave  days,  supplemental  care  days,  external  cause  of  injury  and  underlying 
cause  of  separation  for  disability  separations.  Individual  Patient  Data  System 
( I PDS )  records  are  edited  extensively  before  being  added  to  the  data  base, 
consistency  checks  are  made  between  fields,  as  well  as  quality-type  edits.  Age 
or  sex-related  diagnoses  are  verified.  Admission,  disposition  data  sequencing 
and  computations  of  days  fields  (bed,  sick,  convalescent  leave,  supplemental 
care,  cooperative  care,  other)  are  checked.  By  the  time  a  record  is  through  the 
editing  process,  it  is  error-free  as  much  as  is  possible  with  machine  editing. 
This  data  base  was  used  as  input,  to  a  FORTRAN  program  that  essentially  copied 
the  240  characters  of  the  IPOS  record  and  added  an  additional  112  characters, 
making  toe  ORG  record  382  characters  in  length 


The  I C 0 - 9  diagnoses  and  ICPM  procedures  were  converted  to  ICD-9-CM  codes 
using  the  Navy  map  mentioned  above.  This  map  contained  approximately  170 
diagnosis  codes  and  650  procedure  codes.  In  the  conversion  program,  if  an 
equivalent  code  were  not  found  in  the  map,  then  a  0  was  added  to  the  diagnosis 
to  make  a  5-digit  IC0-9-CM  diagnosis  code.  If  an  equivalent  code  were  not  found 
in  the  procedure  map,  then  that  particular  procedure  was  not  converted.  Those 
converted  diagnosis  and  procedure  codes  were  then  output  onto  the  new  record. 

The  program  also  converted  age  to  a  3-character  field,  sex  to  a  numeric 
1-character  field,  and  the  IPOS  disposition  (or  discharge)  to  a  1-digit  numeric 
code.  In  an  effort  to  render  the  best  possible  conversion  map,  it  was  learned 
that  The  Health  Systems  Management  Group,  School  of  Organization  and  Management, 
Yale  University  were  doing  a  similar  conversion  for  international  comparisons 
using  DRGs .  In  January,  communication  began  with  the  Yale  staff  and  a  contract 
was  negotiated  with  Commission  on  Professional  and  Hospital  Activities  (CPHA) 
for  nosological  expertise  and  recommendations  on  the  map.  In  March,  Mr.  Terry 
Kay  from  NSHS,  LTC  Cecere,  M.D.,  and  COL  Rosenberg,  M.D.  of  the  PMS,  and  the 
authors  met  and  corrected  recognizable  deficiencies  in  the  procedure  map. 
Suggestions  and  recommendations  were  received  from  Yale  University  (for  both 
diagnoses  and  procedures)  and  CPHA  (for  procedures  only).  In  many  cases  Yale 
and  CPHA  recommendations  were  different.  Some  of  their  suggestions  were 
integrated  into  the  map;  others  are  still  being  evaluated.  Changes  from 
the  81  Grouper  to  the  83  Grouper  program  improved  the  grouping  substantially. 
Some  changes  corrected  errors  in  the  81  program  logic,  while  others  improved 
the  groupings.  A  summary  of  the  changes  implemented  with  1983  Grouper  are  as 
fol 1 ows : 

a.  Thirteen  procedure  codes  were  changed  from  "OR"  to  "N0N-0R." 

b.  Five  procedure  codes  were  changed  from  "N0N-0R"  to  "OR." 

c.  Procedure  codes  were  added  to  additional  DRGs  as  indicated  in  Table  1. 
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Table  1 


PROCEDURE  CODE  CHANGES  FROM  81  TO  83  GROUPER 


MDC 

#  PROCEDURE  CODES  ADDED 

DRG  ADDED  TO 

01 

38 

7-8 

05 

20 

120 

06 

2 

154-156 

06 

1 

152-153 

06 

1 

157-158 

07 

1 

193-194 

08 

1 

213 

08 

1 

226-227 

08 

7 

233-234 

09 

1 

267 

09 

3 

269-270 

10 

1 

292-293 

11 

1 

303-305 

13 

3 

360 

13 

3 

363-364 

14 

1 

375 

14 

1 

381 

16 

1 

394 

21 

16 

442-443 

22 

37 

459 

d.  Some  diagnosis  codes  were  changed  between  MDCs: 

2  diagnosis  codes  previously  assigned  to  DRG  469  were  added  to  MDC  16. 

1  diagnosis  code  was  added  to  MDC  14. 

e.  The  grouper  program  was  corrected  to  allow  Newborns  with  no  secondary 
diagnosis  to  be  assigned  to  DRG  391,  Normal  newborns. 

f.  The  surgical  DRG  hierarchy  for  MDCs  6  and  8  was  changed. 

g.  Two  hundred  sixty-nine  codes  were  added  to  the  complications/comorbidity 
list,  and  46  codes  were  eliminated  from  the  list.  One  hundred 
fifty-one  of  the  codes  added  were  nature  of  injury  codes. 

h.  The  coding  of  discharge  status  was  changed  to  the  UB-82  convention, 
using  the  same  categories  of  discharges  but  changing  the  codes  to  two- 
digit  codes. 

i.  Detailed  listings  of  the  changes  are  found  in  Appendix  H  to  the  Manual, 
The  Revised  ICD-9-CM  Diagnos i s  Related  Groups  (DRGs) . 

Use  of  the  83  version  of  the  Grouper  began  in  mid-April  1984.  In  analyzing 

the  newly  grouped  data,  the  distinction  desired  between  types  of  hospitals  was 


not  achieved.  A  change  was  made  to  the  conversion  program  to  not  convert  proce¬ 
dures  that  had  been  performed  at  another  hospital  (on  transfer  records).  This 
slightly  improved  the  differentiation  between  types  of  hospitals.  CPHA  recom¬ 
mendations  included  the  mapping  of  all  "class  1"  procedures,  which  they  define 
as  surgery.  Some  of  these  had  been  included  in  the  previous  conversion  map 
used,  and  some  had  not.  About  the  beginning  of  May  the  map  was  expanded  to 
include  verbatim  addition  of  the  CPHA  recommendations  concerning  procedures. 

This  change  seemed  to  improve  the  groupings  by  DRG  more  than  any  other  single 
modification  up  to  that  point.  Evaluation  of  Yale  recommendations  for  the 
diagnosis  and  procedure  maps  still  remains  to  be  done.  Also,  CPHA  is  expected 
to  submit  their  recommendations  for  the  diagnosis  map  in  the  near  future  pro¬ 
viding  the  capstone  event  for  this  major  area.  It  is  hoped  that  a  satisfactory 
map  can  be  derived  that  will  successfully  and  accurately  assign  records  to  the 
DRG  they  would  have  been  grouped  to  had  the  data  been  originally  coded  by  the 
ICD-9-CM  classification. 

Once  the  "best  map"  has  been  attained,  analysis  will  be  made  as  to  the 
accuracy  of  that  map.  Table  2  shows  changes  to  the  first  few  DRGs  through  the 
various  mapping  and  grouper  changes.  One  Air  Force  site  has  coded  a  sample  of 
primary  inpatient  records  using  ICD-9-CM,  and  has  coded  the  same  records  using 
I  CD  -  9  and  ICPM.  Plans  for  future  work  with  Wilford  Hall  Air  Force  Medical 
Center  in  analyzing  the  results  of  this  dual  coding  experiment  should  aid  in 
validation  of  the  diagnosis  and  procedure  maps.  All  through  the  mapping  pro¬ 
cess,  case  mix  indices  (CMIs)  have  been  computed  using  the  HCFA  weights  as  com¬ 
puted  for  Medicare,  the  1982  Schedule  of  Reimbursement  used  for  New  Jersey  and 
the  1984  reimbursement  schedule  for  the  Veterans  Administration  hospitals 
(Nashville).  CMIs  using  each  of  these  weighting  systems  were  computed  for  a 
sample  of  records  from  the  FY  81  file  (3,859  records),  then  later  for  each  of 
the  three  fiscal  yea^s  (FY  81-FY  83).  Each  iteration  of  the  map  seemed  to 


Table  2 

EFFECT  OF  CHANGES  ON  FY  83  DRG  FREQUENCY 


DRG 

81  GROUPER 

83  GROUPER 

83  GROUPER 
EXCL  SURG 

AT  OTHER  MTF 

83  GROUPER 
EXCL  SURG 

AT  OTHER  MTF 
INCL  CLASS  1 

001 

256 

256 

236 

251 

002 

134 

134 

85 

85 

003 

128 

128 

126 

127 

004 

110 

110 

87 

88 

005 

242 

242 

241 

241 

006 

0 

0 

0 

1028 

007 

133 

140 

131 

54 

008 

1227 

1273 

1268 

310 

009 

135 

137 

158 

158 

010 

101 

101 

104 

104 

Oil 

133 

133 

140 

140 

012 

457 

461 

463 

456 

013 

336 

338 

338 

338 

014 

1078 

1088 

1102 

1103 

015 

699 

699 

701 

699 

016 

80 

82 

83 

76 

017 

185 

186 

186 

174 

018 

148 

142 

144 

146 

019 

918 

928 

934 

933 

020 

831 

832 

833 

833 

021 

448 

448 

448 

449 

022 

6 

6 

6 

6 

023 

54 

54 

54 

55 

024 

408 

369 

369 

373 

025 

2150 

2193 

2194 

2190 

026 

1061 

1060 

1062 

1062 

027 

0 

0 

0 

0 

028 

534 

598 

654 

668 

029 

1883 

1947 

1977 

1975 

030 

876 

889 

891 

893 

improve  the  CMI,  with  the  exception  of  the  Yale  recommendations.  Using  their 
map  exclusively  produced  the  lowest  set  of  MTF  CMIs  of  all  the  CMIs  computed. 
Table  3  shows  the  changes  in  CMI  through  some  of  the  various  mappings  using 
Medicare  weights. 

Another  type  of  map  developed  was  an  ICD-9  conversion  to  CPHA's  "List  A 
Groups".  This  was  done  by  Army  to  be  used  for  conversion  of  IPOS  data  to  List  A 
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Table  3 

EFFECT  OF  CHANGES  ON  FY  83 
AOJUSTED  CASE  MIX  INDEX 


MTF 

81  GROUPER 

83  GROUPER 

83  GROUPER 
EXCL  SURG 

AT  OTHER  MTF 

83  6R0UPER 
EXCL  SURG 

AT  OTHER  MTF 
W/MOD  MAP 

LETTERMAN  AMC 

1.0285 

1.0295 

1.0264 

1.0669 

BROOKE  AMC 

1.0108 

1.0143 

1.0106 

1.0483 

WALTER  REED  AMC 

0.9673 

0.9668 

0.9625 

0.9935 

FITZSIMONS  AMC 

0.8830 

0.8810 

0.8795 

0.9059 

EISENHOWER  AMC 

0.8924 

0.8960 

0.8627 

0.8955 

WM  BEAUMONT  AMC 

0.8099 

0.8081 

0.7970 

0.8093 

REDSTONE  ARSENAL 

0.8251 

0.8072 

0.7981 

0.7968 

FT  MEADE 

0.8072 

0.7957 

0.7927 

0.7962 

FT  DEVENS 

0.7987 

0.7901 

0.7830 

0.7852 

MAD I GAN  AMC 

0.7696 

0.7689 

0.7643 

0.7742 

TRIPLER  AMC 

0.7416 

0.7440 

0.7368 

0.7589 

FT  EUSTIS 

0.7518 

0.7474 

0.7430 

0.7479 

LANDSTUHL,  GE 

0.7239 

0.7223 

0.7223 

0.7461 

FT  MONMOUTH 

0.7475 

0.7501 

0.7478 

0.7455 

SEOUL,  KOREA 

0.7522 

0.7422 

0.7411 

0.7370 

FT  BRAGG 

0.7258 

0.7262 

0.7229 

0.7232 

FT  DIX 

0.7232 

0.7244 

0.7228 

0.7229 

FT  MCCLELLAN 

0.7258 

0.7226 

0.7213 

0.7206 

FT  BEN  HARRISON 

0.7489 

0.7214 

0.7169 

0.7169 

FT  LEE 

0.7244 

0.7163 

0.7124 

0.7151 

FT  BENNING 

0.7216 

0.7207 

0.7119 

0.7146 

GORGAS,  CZ 

0.7078 

0.7077 

0.7068 

0.7110 

FT  LEAVENWORTH 

0.7199 

0.7190 

0.7073 

0.7107 

FT  IRWIN 

0.6943 

0.6898 

0.6898 

0.6886 

FT  STEWART 

0.6744 

0.6771 

0.6745 

0.6818 

FT  JACKSON 

0.6786 

0.6809 

0.6800 

0.6801 

FT  LEONARD  WOOD 

0.6845 

0.6839 

0.6799 

0.6789 

WEST  POINT 

0.6684 

0.6677 

0.6677 

0.6771 

AUGSBURG,  GE 

0.6751 

0.6755 

0.6731 

0.6751 

FT  KNOX 

0.6809 

0.6744 

0.6694 

0.6702 

FT  HUACHUCA 

0.6790 

0.6722 

0.6720 

0.6700 

FT  POLK 

0.6633 

0.6658 

0.6631 

0.6630 

FT  CARSON 

0.6635 

0.6643 

0.6632 

0.6623 

FT  ORD 

0.6602 

0.6618 

0.6598 

0.6607 

FT  RUCKER 

0.6649 

0.6599 

0.6599 

0.6563 

FT  CAMPBELL 

0.6599 

0.6581 

0.6524 

0.6552 

FT  HOOD 

0.6613 

0.6577 

0.6552 

0.6550 

FRANKFURT,  GE 

0.6536 

0.6535 

0.6521 

0.6515 

FT  SILL 

0.6513 

0.6529 

0.6451 

0.6497 

FT  RILEY 

0.6427 

0.6451 

0.6429 

0.6456 

■'^T-  ■  . - 
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Table  3  (continued) 

EFFECT  OF  CHANGES  ON  FY  83 
ADJUSTED  CASE  MIX  INDEX 


MTF 

81  GROUPER 

83  GROUPER 

83  GROUPER 
EXCL  SURG 

AT  OTHER  MTF 

83  GROUPER 
EXCL  SURG 

AT  OTHER  MTF 
W/MOD  MAP 

BAD  CANNSTATT,  GE 

0.6419 

0.6391 

0.6373 

0.6449 

WUERZBURG,  GE 

0.6423 

0.6383 

0.6335 

0.6338 

BREMERHAVEN,  GE 

0.6309 

0.6232 

0.6140 

0.6297 

ALASKA 

0.6258 

0.6250 

0.6238 

0.6244 

NUERNBERG,  GE 

0.6361 

0.6222 

0.6212 

0.6234 

VICENZA,  IT 

0.6107 

0.6157 

0.6113 

0.6113 

SHAPE,  BELGIUM 

0.6186 

0.6114 

0.6088 

0.6105 

LEGHORN,  IT 

0.6427 

0.5981 

0.5974 

0.6043 

FT  BELVOIR 

0.5980 

0.5984 

0.5984 

0.5977 

HEIDELBERG,  GE 

0.5910 

0.5900 

0.5890 

0.5889 

BERLIN,  GE 

0.5838 

0.5835 

0.5774 

0.5807 

Groups  for  a  second  analysis  employing  another  case  mix  measure  referred  to  as 
the  Resource  Need  Index  (RNI ) .  The  RN I  is  a  CPHA  measure  that  addresses 
expected  resource  variation  based  on  case  mix.  With  this  "List  A"  map,  the  Army 
has  gained  the  ability  to  compare  AMEDO  data  with  CPHA's  national  normative 
data.  CPHA's  data  is  from  a  large  sample  of  Professional  Activities  Study  (PAS) 
hospitals  and  reports  LOS  by  List  A  Groups.  The  variables  included  within  List 
A  Groups  are:  whether  or  not  surgery  was  performed,  presence  or  absence  of 
secondary  diagnosis,  and  five  age  groups.  The  variance  and  percentiles  (5th, 
10th,  75th,  90th,  95th,  and  99th)  are  also  reported.  Since  1980  when  DOD  began 
coding  from  I C 0-9  and  civilian  hospitals  were  using  ICD-9-CM,  very  little  com¬ 
parison  could  be  made  between  DOD  and  their  civilian  counterparts.  A  map  from 
ICD-9  to  List  A  Groups  now  gives  DOD  that  capability  of  comparison,  and  the  Army 
expects  to  do  more  in  this  area  of  analysis. 
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INITIAL  CASE  MIX  ACCOUNTING 

The  current  edition  of  Diagnosis  Related  Groups  establishes  twenty-three 
Major  Diagnostic  Categories  (MDCs).  Table  4  provides  an  english  description  of 
each  MDC .  The  MDCs  were  formed  by  physician  panels  as  the  first  step  toward 
insuring  that  the  DRGs  would  be  clinically  coherent.  The  diagnoses  in  each  MDC 
correspond  to  a  single  organ  system  or  etiology  and  in  general  are  associated 
with  a  particular  medical  specialty.  MDCs  14  and  15  contain  the  greatest 
amount  of  Army  data  representing  Pregnancy,  Childbirth  and  the  Puerperium  and 
Newborns  and  Other  Neonates  with  Conditions  Originating  in  the  Perinatal  Period, 
respectively.  A  graphic  presentation  of  the  frequency  of  AMEDD  inpatient  data 
by  Major  Diagnostic  Category  for  FY  1983  is  at  Figure  5.  Correspondingly,  a 
review  of  the  top  twenty  Diagnoses  Related  Groups  shows  that  DRGs  391  and  373, 
representing  Normal  Newborns  and  Vaginal  Delivery  without  Complicating 
Diagnoses,  respecti vely,  have  the  highest  frequency  (See  Figure  6).  It  should 
be  noted  that  the  MDC  frequency  graphic  includes  data  for  MDC  24.  MDC  24  is 
not  an  actual  MDC,  as  described  above,  but  instead  reflects  the  data  assigned 
to  DRGs  469  and  470  which  are,  in  effect,  catch-all  DRGs  where  data  did  not  fit 
the  delineation  of  DRGs  1  through  468. 

Case  mix  information  has  been  created  using  HCFA  established  relative 
weight  units  for  fiscal  years  1981-1983.  Case  mix  data  prepared  for  these  pro¬ 
ceedings  represent  FY  1983  data.  Figures  7,  8,  and  9  display  MTF  Case  Mix 
Indexes  for  the  three  types  of  MTF  groups  commonly  known  within  the  US  Army 
Health  Services  command  as  MEDCENS  (Medical  Centers),  Large  MTF  and  Small/Medium 
MTF.  The  following  definitions  may  be  helpful  in  explaining  the  concept  asso¬ 
ciated  with  case  mix.  Case  Mi x  is  defined  as  the  relative  proportion  of  cases 
that  fall  into  mutually  exclusive  case  types  (e.g.,  DRGs).  Case  Mix  Measures 
are  calculated  by  application  of  a  set  of  weights  to  the  number  of  patients 
falling  into  each  case  type  within  a  hospital.  Each  case  type  (e.g.,  DRG) 
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Table  4 


LISTING  OF  TWENTY-THREE  MAJOR  DIAGNOSTIC  CATEGORIES 


Major  Diagnostic 
Cateogry  Number 


Major  Diagnostic  Category 
English  Description 


Diseases  and  Disorders  of  the  Nervous 
System 

Diseases  and  Disorders  of  the  Eye 
Diseases  and  Disorders  of  the  Ear,  Nose 
and  Throat 

Diseases  and  Disorders  of  the  Respiratory 
System 

Diseases  and  Disorders  of  the  Circulatory 
System 

Diseases  and  Disorders  of  the  Digestive 
System 

Disease  and  Disorders  of  the  Hepatobiliary 
System  and  Pancreas 

Disease  and  Disorders  of  the  Musculoskeletal 
System  and  Connective  Tissue 
Disease  and  Disorders  of  the  Skin,  Subcutaneous 
Tissue  and  Breast 

Endocrine,  Nutritional,  and  Metabolic 
Diseases  and  Disorders 
Diseases  and  Disorders  of  the  Kidney  and 
Urinary  Tract 

Diseases  and  Disorders  of  the  Male 
Reproductive  System 
Diseases  and  Disorders  of  the  Female 
Reproductive  System 

Pregnancy,  Childbirth  and  the  Puerperium 
Newborns  and  Other  Neonates  with  Conditions 
Originating  in  the  Perinatal  Period 
Diseases  and  Disorders  of  the  Blood  and 
Blood-forming  Organs  and  Immunological 
Disorders 

Myeloproliferative  Diseases  and  Disorders 
and  Poorly  Di f ferenti ated  Neoplasms 
Infectious  and  Parasitic  Diseases  (Systemic  or 
Unspecified  Sites) 

Mental  Diseases  and  Disorders 
Substance  Use  and  Substance  Induced  Organic 
Mental  Disorders 

Injuries,  Poisonings  and  Toxic  Effects  of 
Drugs 
Burns 

Factors  Influencing  Health  Status  and 
Contacts  with  Health  Services 


Figure 


FREQUENCY  OF  DIAGNOSTIC  RELATEO  GROUPS 


IGHES 
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receives  a  distinct  weight,  constant  for  all  hospitals,  that  represents  the 
relative  hospital  resource  utilization  of  an  average  or  ideal  case  that  falls 
into  that  case  type.  In  this  case,  the  weights  employed  were  those  published  by 
HCFA.  The  relative  weight  products  are  summed  and  divided  by  the  total  fre¬ 
quency  of  patient  dispositions.  The  quotient  results  in  a  case  mix  index.  The 
dimension  of  complexity  that  is  estimated  in  any  given  case  mix  measure  is 
determined  by  the  set  of  weights  that  is  used  to  estimate  the  relative  expected 
patient  resource  consumption.  Yet  without  validation  of  these  weights  to  AMEDD 
data,  they  should  be  considered  prel iminary  and  subject  to  further  study. 

Notwithstanding,  health  services  researchers  have  demonstrated  that  case 
mix  usually  accounts  for  the  major  variations  in  hospital  costs  and  performance. 
The  contribution  these  studies  have  made  to  understanding  hospital  organiza¬ 
tional  behavior  led  PAS&BA  and  the  AMEDD  Performance  Measurement  Study  to 
examine  the  utility  of  case  mix  measurement  data  in  explaining  AMEDD  MTF 
Performance. 

The  case  mix  reports  that  have  been  developed  thus  far  primarily  reflect 
the  distribution  of  hospital  data  across  DRGs  and  provide  a  summary  report  that 
describes  inpatients,  the  relative  weight  distribution,  and  the  case  mix  index. 
The  following  text  portrays  the  key  elements  of  the  summary  report  and  suggested 
interpretation.  The  narrative  is  illuminated  by  graphic  displays  drawn  from 
data  from  two  medical  centers  (MEDCENs)  and  two  primary  care  hospitals  (MEDDACs) 
(See  Figures  10-13).  The  section  concludes  with  a  complete  copy  of  an  actual 
hospital  DRG/Case  Mix  profile  report  (Table  5). 

There  are  two  case  mix  index  figures  provided  in  the  attached  displays, 
the  unadjusted  and  the  adjusted.  The  adjusted  case  mix  is  the  more  valuable 
of  the  two  indexes  in  that  patient  records  assigned  to  DRGs  469  and  470  have 
been  eliminated.  These  were  eliminated  because  the  relative  weight  value  for 
each  is  0.0.  Civilian  hospitals  with  data  in  these  DRGs  would  review  these 
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records  in  hopes  of  having  them  regrouped  to  more  definitive  DRGs.  Therefore, 
we  have  computed  the  adjusted  case  mix  index  without  using  records  assigned  to 
DRG  469  or  470.  The  "total  count  adjusted  for  DRG  cells  469  and  470"  reveals 
the  number  of  IPOS  dispositions  assigned  to  DRGs  1  through  468.  This  count 
states  the  number  of  records  included  in  the  calculation  of  the  "adjusted"  case 

mix  index.  The  distribution  of  patients  across  Relative  Weights  Quartiles 

describes  the  MTF  workload  in  terms  of  "relative  weight  quartiles"  by  reporting 
the  count  and  per  cent  of  workload  developed  by  quartile.  This  chart  reflects 
the  Health  Care  Financing  Administration  (HCFA)  established  relative  weights 
sorted  and  arranged  by  quartiles  with  the  1st  quartile  reflecting  the  DRG  with 
the  lowest  relative  weight  value  through  the  DRG  with  a  relative  weight  value  of 
0.6291  and  so  on  for  each  quartile.  This  module  may  be  used  to  explain  the  high 
or  low  value  of  the  case  mix  index  by  displaying  the  concentration  of  disposi¬ 
tions  in  DRGs  with  low  or  high  value  relative  weights.  As  an  example,  if  60% 
of  a  hospital's  data  fell  into  the  first  quartile,  it  would  seem  reasonable  to 

expect  a  fairly  low  case  mix  index  which  may  be  further  interpreted  to  mean  a 

relatively  low  average  case  complexity  for  the  hospital.  Conversely,  a  higher 
case  mix  index  could  be  anticipated  if  an  MTF  had  a  relatively  even  distribution 
of  data  across  each  of  the  relative  weight  quartiles.  The  patient  category 
distribution  module  describes  an  MTF  in  terms  of  general  patient  beneficiary 
categories  using  IPDS  data  elements.  This  adds  to  the  interpretation  of  the 
case  mix  index  value  by  alluding  to  the  types  of  diseases  or  injuries  one  could 
anticipate  based  upon  the  activity  level  or  demographic  features  frequently 
associated  with  a  given  segment  of  the  MTF  catchment  area  population. 

The  age  graphic  displays  some  critical  features  of  an  MTF's  inpatient  popu¬ 
lation  by  showing  the  data  by  category  with  the  respective  percent  of  data 
attributed  to  each  age  cell.  This  further  facilitates  explanation  of  the  case 
mix  index.  In  general,  the  younger  the  population  represented  in  the  inpatient 


data,  the  less  complex  the  diseases  and  injuries  treated.  Conversely,  the 
greater  the  percent  of  patients  in  the  older  age  cells,  the  higher  the  case  mix 
index  because  of  the  presence  of  complicating  and/or  comorbid  conditions. 

A  hospital  is  considered  to  have  treated  patients  with  an  average  case 
complexity  when  the  case  mix  index  is  1.00.  This  figure  represents  the  average 
case  mix  index  for  hospitals  in  the  private  sector  based  on  calculations  made 
from  1981  cost  and  billing  data  published  in  the  1  September  1983  Federal 
Register.  (FR  48:  171).  Specific  average  case  mix  indexes  by  hospital  classi¬ 
fication  group  appear  in  Table  3b  FR  48  page  39871.  The  average  case  mix 
complexity  figures  should  not  be  considered  as  a  viable  comparison  to  AMEDD  MTF 
for  several  reasons:  1)  our  CMIs  reflect  all  patient  categories  (i.e.,  not 
just  Medicare  eligible  patients);  2)  because  of  our  mission,  AMEDD  MTF  treat  a 
young  active  duty  population  as  the  first  priority  for  care;  and,  3)  the  nuances 
of  patient  classification  coupled  with  preliminary  status  of  the  coding  conver¬ 
sion  process  could  tend  to  misdirect  or  malassign  AMEDD  patient  data  causing  the 
DRG  assignment  to  be  adversely  affected. 

Pearson  correlation  coefficients  computed  across  each  of  the  three  relative 
weight  systems  previously  discussed  resulted  in  the  following:  HCFA  weights  and 
NJ  average  cost  per  DRG  data  from  the  1982  Schedule  of  Reimbursement  produced 
an  r  value  of  0.9828.  The  HCFA  correlation  with  the  VA  (Nashville)  costs  per 
DRG  resulted  in  an  r  value  of  0.8964.  New  Jersey  weights  correlated  with  the  VA 
(Nashville)  weights  produced  an  r  value  of  0.9265.  Based  upon  the  close  rela¬ 
tionship  between  the  relative  weights  and  the  potential  comparability  with 
civilian  hospitals,  the  PMS  DRG  effort  chose  to  focus  on  the  HCFA  relative 
weights  for  case  mix  accounting. 

DIRECTION  OF  FUTURE  RESEARCH 

An  area  with  great  potential  to  explain  and  perhaps  predict  resource  con¬ 
sumption  is  the  development  of  linkages  between  UCA  at  the  3  digit  level  ( e . g . , 


246 


AAA)  and  IPDS  clinic  service.  Reports  with  mean  clinic  service  case  mix 
indexes  have  demonstrated  variation  within  MTF  across  clinic  services  as  well 
as  across  MTF  for  the  same  clinic  service. 

The  resolution  of  nosological  issues  will  remain  at  the  forefront  of  our 
research  until  we  obtain  closure  vis-a-vis  the  "best"  map.  The  effectiveness 
and  appropriateness  of  converting  ICD-9/ICPM  data  to  ICD-9-CM  is  central  to 
gaining  an  accurate  DRG  assignment.  A  supplementary  research  issue  will  be  the 
evaluation  and  incorporation  of  the  Wilford  Hall  AFMC  direct  ICD-9-CM  coding 
test  for  a  stratified  sample  of  that  facility's  data.  The  goal  in  this  area 
is  to  compare  the  DRG  assignment  of  records  converted  from  ICD-9  to  ICD-9-CM 
and  those  coded  directly  to  ICD-9-CM. 

Other  areas  that  are  planned  for  investigation  include:  1)  the  use  of  the 
Autogrp  system  to  reevaluate  and  possibly  re-construct  DRGs  or  patient  groups 
unique  to  the  AMEDD.  2)  Introduce  disease  staging  to  explain  variation 
further  either  within  MDC  or  within  DRG.  3)  Evaluate  the  contribution  patient 
acuity  data  may  make  in  explaining  variations  within  DRGs. 
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THE  RELATIONSHIP  BETWEEN  INPATIENT  SERVICE  COST 
AND  CASE-COMPLEXITY  AT 
WILFORD  HALL  USAF  MEDICAL  CENTER 


CAPT  Scott  A.  Optenberg,  MSC,  USAF 
CAPT  Samuel  P.  Fye,  MSC,  USAF 
CAPT  Richard  E.  Bigelow,  MSC,  USAF 
CAPT  William  D.  Haddock,  MSC,  USAF 
SSGT  Robert  F.  Ward,  USAF 

INTRODUCTION 

This  study  was  conducted  at  Wilford  Hall  USAF  Medical  Center  (WHMC)  to 
examine  the  relationship  between  complexity  of  case  mix  and  cost  of  providing 
patient  care  at  a  military  medical  facility.  WHMC,  located  at  Lackland  Air 
Force  Base,  San  Antonio  Texas,  is  the  largest  medical  center  in  the  Air  Force 
and  its  patient  population  represents  a  wide  diversity  in  complexity  of  case 
mix.  Facility  size  and  case-complexity  made  WHMC  ideal  for  examining  the  rela¬ 
tionship  between  cost  and  case-complexity. 

Sixteen  major  inpatient  services  were  selected  for  study  and  the  rela¬ 
tionship  between  service  case-complexity  and  cost  of  providing  inpatient  care 
was  examined.  Costs  were  derived  using  Uniform  Chart  of  Accounts  (UCA)  data. 
Since  UCA  is  the  cost  accounting  method  for  all  Department  of  Defense  (DoD) 
medical  facilities  it  is  important  to  examine  if  the  cost  of  providing  patient 
care  is  sensitive  to  changes  in  case-complexity.  UCA's  ability  to  reflect 
changes  in  case-complexity  has  future  implications  for  use  as  a  budgeting  tool 
sensitive  to  case-complexity,  as  well  as  for  evaluating  the  efficiency  of  pro¬ 
viding  patient  care  based  on  cost  per  unit  of  output. 

METHODOLOGY 

Sample  Selection 

The  sample  consisted  of  1323  patient  records  sampled  from  16  services  which 
reflected  the  major  inpatient  services  at  WHMC  for  FY  82.  The  only  major 
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service  not  sampled  was  obstetrics.  Unique  military  codes  added  to  the  diagno¬ 
sis  codes  within  this  service  made  sample  selection  difficult. 

The  source  for  the  sample  was  the  Medical  Administrati ve  Management  System 
Revised  (MAMSR).  MAMSR  was  the  automated  patient  registration  system  used  by 
selected  Air  Force  medical  facilities  during  the  study  period.  This  system 
included  individual  patient  information  with  regard  to:  (1)  service  of 
discharge;  (2)  diagnoses  and  procedures;  (3)  length  of  stay;  (4)  beneficiary 
type;  (5)  patient  referral  status;  and  (6)  treating  physician.  Using  historical 
MAMSR  data,  two  stage  proportional  sampling  was  used.  Initially,  a  target 
sample  size  of  1330  cases  was  established.  Sample  size  was  based  on  service 
diagnostic  variability  as  well  as  the  cost  estimate  for  recoding  records. 

In  the  first  stage,  the  number  of  records  to  be  sampled  from  each  of  the 
sixteen  services  was  determined  using  the  following  formula: 

Number  of  Dispositions  in  Service! 

. . .  X  Target  Sample  Size 

Total  Dispositions  of  Study  Services 

Where:  Service!  =  Each  of  the  16  services  analyzed. 

In  the  second  stage  the  number  of  individual  diagnoses  to  be  sampled  from 
each  service  was  determined  using  the  following  formula: 

Number  of  Diagnosisj  in  Service^ 

_ __ _  X  Service!  Sample  Size 

Total  Dispositions  in  Service! 

Where:  Diagnosis-;  -  each  diagnosis  in  Service! 

After  the  number  of  cases  required  for  each  diagnosis  within  a  service  was 
determined,  the  cases  were  randomly  selected  from  each  diagnosis. 

Case-Complexity 

Upon  completion  of  the  sample  selection,  the  1323  patient  records  were 
retrieved.  The  records  were  then  manually  recoded  from  ICD-9  (ICD-9  is 
currently  used  by  DoD)  to  ICD-9-CM  codes.  To  insure  accuracy  of  recoding, 
each  record  was  reviewed  by  a  supervisor.  After  conversion  to  ICD-9-CM 
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codes.  Health  Systems  International  (HSI)  DRG  Grouper  Tape  (August  1983  edition) 
was  used  to  obtain  the  DRG  classification. 

All  but  59  records  were  successfully  grouped  to  DRG.  Failure  to  group  these 
59  records  was  due  to  a  programming  error  in  age  computation  for  patients  under 
one  year  of  age.  Once  age  corrections  were  made  these  remaining  records  were 
manually  grouped  using  the  HSI  Revised  ICD-9-CM  DRG  Manual. 

DRGs  were  then  assigned  complexity  weights  using  Relative  Weighting  Factors 
from  the  Health  Care  Financing  Administration  (HCFA)  1981  survey  of  5853  hospi¬ 
tals  nationwide. *  Each  weight  factor  in  this  survey  reflects  the  individual  DRG 
mean  patient  charge  relative  to  the  mean  charge  for  all  DRGs.  Mean  case- 
complexity  for  each  service  was  determined  by  averaging  the  weight  factors  of 
the  DRGs  within  each  service. 

Facility  Service  Costs 

Cost  data  were  derived  from  the  DoD  Uniform  Chart  of  Accounts  (UCA).  UCA 
is  an  automated  costing  methodology  which  distributes  costs  to  final  accounts 
within  the  hospital.  These  final  accounts  include  clinical  services,  outpatient 
clinics,  dental  services  and  special  programs.  In  UCA,  total  costs  are 
comprised  of  direct  costs  and  indirect  costs.  Direct  costs  are  those  costs, 
such  as  salary  costs  and  certain  supply  costs  that  can  be  directly  assigned  to  a 
final  account.  Indirect  costs  are  assigned  using  a  step-down  purification  pro¬ 
cess  which  distributes  ancillary  and  support  costs  to  the  final  accounts  they 
serve. 

Support  costs  include  the  costs  of  equipment  depreciation,  administrative 
support  overhead  (to  include  base  support,  such  as  police  and  fire  protection, 
etc.)  and  custodial  support  services.  These  costs  are  distributed  to  the 

T./See,  Federal  Register,  Vol.  48,  No.  171,  pp.  39876  to  39886,  September  1, 
1983.  ' .  “ 
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respective  final  accounts  based  on  a  step-down  assignment  factor  specific  to 
that  support  function  (e.g.,  square  footage,  pounds  of  linen  processed,  etc.) 

Ancillary  costs  include  the  costs  of  certain  medical  services  such  as 
laboratory,  radiology,  pharmacy  and  rehabilitative  services.  These  costs  are 
distributed  to  final  accounts  based  on  weighted  procedures  provided  to  the  final 
account.  Weighted  procedures  are  specific  to  the  ancillary  service  (e.q., 
weighted  laboratory  values  and  pharmacy  prescriptions,  etc.).^ 

UCA  costs  were  derived  for  each  of  the  sixteen  services  using  FY  82  UCA  Unit 
Detail  Cost  Reports  and  Computation  Summaries.  The  following  service  mean  costs 
were  derived  by  dividing  respective  annual  costs  by  total  annual  workload:  (1) 
Total  mean  cost  per  patient  day;  (2)  Total  mean  cost  per  disposition;  (3)  Sup¬ 
port  mean  cost  per  patient  day;  (4)  Support  mean  cost  per  disposition;  (5)  An¬ 
cillary  mean  cost  per  patient  day;  and,  (6)  Ancillary  mean  cost  per  disposition. 

ANALYSIS 

Descriptive  Statistics 

Table  1  includes  total,  support  and  ancillary  cost  per  day,  cost  per  dispo¬ 
sition,  and  the  mean  case-complexity  for  each  of  the  16  services  studied. 

Service  mean  case-complexity  at  WHMC  ranged  from  .460  for  Opthalmology  to  2.074 
for  Cardiothoracic  Surgery.  Service  mean  total  cost  per  bed  day  ranged  from 
S143.60  for  Neurology  to  $386.27  for  Cardiology.  Service  mean  total  cost  per 
disposition  ranged  from  $1716.59  for  Neurology  to  $4155.41  for  Dsychiatry. 

Descriptive  statistics  for  the  study  sample  are  presented  in  Table  2.  there 
were  254  unique  DRGs  for  the  study  sample  with  an  mean  case-complexity  score  of 
.9836.  The  study  sample  had  a  mean  age  of  36.98  years,  61%  were  married  and 

2.~7For  more  Information,  see  Department  of  Defense  Uniform  Chart  of  Accounts 
for  fixed  Military  Medical  and  Dental  Treatment  Facilities,  DoD  Pub. 

6010. 10-M,  July  19 79. 


SERVICE  MEAN  CASE-COMPLEXITY  AND 
MEAN  COSTS  FOR  WILFORD  HALL  USAF  MEDICAL  CENTER 
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Table  2 


DESCRIPTIVE  STATISTICS  Of  STUDY  POPULATION 

(N  -  1323) 


Services  Examined 
Unique  DRGs 
Mean  Case-Complexity 
Mean  Age 

I 
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!  Marital  status 
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iis jumpt i ons .  Consequently  all  cost  variables  were  log-transformed  and  correla¬ 
tional  analysis  was  performed  for  both  unlogged  costs  and  log-transformed  costs. 

Correlations  between  service  mean  total,  support  and  ancillary  costs  per 
oed  day  and  mean  service  case-complexity  are  presented  in  Table  3.  Correlations 
between  service  mean  total,  support,  and  ancillary  costs  per  disposition  and  mean 
service  case-complexity  are  presented  in  Table  4. 


Table  3 

CORRELATION  OF  SERVICE  MEAN  CASE-COMPLEXITY 
AND  MEAN  COST  PER  BED  DAY1 


All  Services  Psychiatry  Excluded 

(N  =  16)  (N  =  15) 


Type  of  Cost 

Costs 

Un 1 ogged 

Costs 

Logged 

Costs 

Unlogged 

Costs 

Logged 

Total  Costs 

.04 

.00 

.03 

,00 

Support  Costs 

-.06 

-.15 

-.04 

-.15 

Anc i 1 1 arv  Costs 

.1/ 

.12 

.17 

.13 

1 A 1 1  correlations  are  non-significant. 


There  were  no  significant  correlations  between  service  mean  case-complexity 
and  service  mean  total,  iuppo^t,  and  ancillary  costs  per  bed  day  (see  Table  3;. 
This  lack  of  relationship  persisted  when  costs  were  log-transformed  as  well  as 
when  psychiatry  was  m  I  tided. 

In  contrast  tc  the  lack  of  relationship  between  service  mean  cost  per  bed 

day  ano  case-complexity,  there  were  statistically  significant,  strong 

positive  relationships  bet "o on  service  mean  costs  per  disposition  and  service 

mean  ease-comp  1  ex  i1'/  ( sen  Table  4). 4  The  correlation  between  mean  total 

4. /The  algorithm  used  to  test  statistical  significance  of  correlation  coef 
ficients  can  be  fourth  m  Sredecor ,  6.W.  and  Cochran,  W.G.,  Statistical  Method  -  . 
Ames,  Iowa.  Iowa  State  University  Press,  1980,  p.  185. 


Table  4 


CORRELATION  OF  SERVICE  MEAN  CASE -COMPLEX I TY 
AND  MEAN  COST  PER  DISPOSITION 


All  Services 

Psychiatry  Excluded 

N  = 

16) 

(N  =  15) 

Costs 

Cost  ^ 

Costs 

Costs 

Type  of  Cost 

Un  1  nnno(j 

Logge  i 

Unlogged 

Loggeo 

Total  Costs 

.6! 

.85*** 

.80*** 

Support  Costs 

.12 

.2 

.45 

.45 

Anci 1 1  ary  Costs 

.63** 

.54* 

.65** 

.57*  ! 

...  ....  J 

p  <  .05 

p  <  .01 

***  p  <  .001 

cost  per  disposition  and  mean  case-complexity  was  r  =  .69  and  when  psych iat^ 
was  excluded  the  correlation  increased  to  r  =  .85.  The  correlation  between 
mean  ancillary  cost  per  disposition  and  mean  case-complexity  was  also  strong 
[r  -  .63).  vlhen  psychiatry  was  excluded  and  ancillary  costs  per  disposition 
and  mean  case-complexity  were  again  examined  there  was  negligible  change  in 
the  positive  relationship  (r  =  .65).  Although  service  mean  support  cost  per 
disposition  did  demonstrate  a  mild  positive  relationship  with  mean  cas*-1- 
complexity  (r  =  .12),  this  relationship  was  not  statistica1 ly  significant. 
Lxcljd'nq  psychiatry  did  increase  this  correlation  to  r  -  .45,  but  this 
relationship  remained  non-signi  f  icant .  Log-transforming  cost  factors  dr: 
improve  rorre 1 ations . 

Prediction  of  Service  Mean  Cost  per  Disposition 

Prediction  equations  were  calculated  for  service  mean  total  cost  per  a 
v  fion  esin  i  service  mean  case-complexity.  Ordinary  least  squares  regre-  -  r: 


analysis  was  used  to  derive  prediction  equations.  Since  there  was  no  improve¬ 
ment  in  correlations  when  cost  factors  were  log-transformed,  prediction 
equations  were  performed  only  for  unlogged  costs.  Equations  were  derived  using 
all  services  (Table  5)  and  again  when  psychiatry  was  excluded  (Table  7). 
Included  in  Tables  5  and  7  are  graphical  data  displays  with  calculated  least 
squares  lines.  Residuals  are  plotted  against  predicted  values  for  the  analysis 
using  all  services  (Table  6)  and  again  when  psychiatry  was  excluded  (Tablp  R). 

Table  5 

PREDICTION  OF  SERVICE  MEAN  TOTAL  COST  PER  DISPOSITION 
BY  MEAN  CASE -COMPLEXITY 

All  Services  (N  *  16) 


•  .m  i  1.8#  2  2.8# 


AVERACE.CASE. COMPLEXITY 

Coefficient 

Intercept  1469.57 

Complexity  1205.31 

Overall  F  Ratio  =  12.93  p  1  .01  r ?  -  .48 

When  using  all  services,  48%  of  the  variant  ■  in  service  mean  total  o 
p Pr  disposition  was  explained  by  mean  case-comp'exity.  When  psychiatry 
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Table  7 


PREDICTION  OF  SREVICE  MEAN  TOTAL  COST  PER  DISPOSITION 
BY  MEAN  CASE -COMPLEXITY 

Psychiatry  Excluded  (N  *  15) 


4 m 

c 

0 

s 

T 

P 

E  2MP 

R 

D 

r  imp 

s 

p 

p 

P  .BN  1  l.BP  2  2.  BP 

AVERACE.CASE. COMPLEXITY 


Coefficient 

Intercept  1321.39 

Complexity  1248.99 

Overall  F  Ratio  =  33.10  p  <  .0001  rz  =  .72 

For  each  service  the  actual  mean  length  of  stay  was  computed.  Secondly, 
for  each  of  the  254  ORGs  represented  in  the  study  population,  the  HCFA 
geometric  mean  length  of  stay  was  substituted  for  the  actual  length  of  stay 
for  that  ORG.  Service  mean  length  of  stay  was  then  recomputed.  Service  mean 
length  of  stay  data  are  presented  in  Table  9  along  with  service  mean  case- 
complexity.  Services  have  been  ranked  by  case-complexity  to  assist  review. 

A  paired  t-test  was  performed  to  determine  if  the  difference  between  WHMC 
actual  mean  length  of  stay  and  HCFA  mean  length  of  stay  was  statistically 
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Table  8 

PLOT  OF  PREDICTED  MEAN  COST  PER  DISPOSITION  RESIDUALS 
WITH  PREDICTED  MEAN  COSTS 

Psychiatry  Excluded  (N  =  15) 


PRED  ICTED.VALUES 

Serial  Correlation  of  Residuals  =  -.19 

significant.  As  Table  9  indicates  the  difference  in  length  of  stay  was 
statistically  significant. 

Correlations  were  computed  between  service  mean  case-complexity  when 
using  WHMC  service  actual  mean  length  of  stay  and  when  using  service  HCFA  DRG 
mean  length  of  stay  (see  Table  10).  In  addition,  the  difference  between  WHMC 
and  HCFA  service  mean  length  of  stay  was  correlated  with  service  mean  case- 
complexity.  The  correlation  between  WHMC  actual  length  of  stay  and  case- 
complexity  was  r  =  .57.  Excluding  psychiatry  substantially  increased  this 
correlation  (r  »  .83).  As  anticipated  HCFA  mean  length  of  stay  demonstrated 
a  strong  correlation  (r  =  .81).  The  exclusion  of  psychiatry  did  not  improve 
this  relationship.  When  correlating  the  difference  between  WHMC  and  HCFA 


COMPARISON  OF  WHMC  SERVICE  ACTUAL  MEAN  LENGTH  OF  STAY 
AND  HCFA  ADJUSTED  MEAN  LENGTH  OF  STAY 


Service 

Mean 

Case 

Complexity 

WHMC 

Actual  Mean 

Length  of  Stay 

WHMC 

HCFA  Adjusted  Mean 
Length  of  Stay 

Otorhino¬ 

laryngology 

.460 

3.868 

2.958 

Ophthalmology 

.565 

6.556 

3.590 

Plastic 

Surgery 

.622 

5.889 

3.822 

Nursery 

.678 

5.182 

4.336 

Internal 

Medicine 

.829 

8.217 

6.223 

Pediatrics 

.860 

6.280 

5.123 

Neurology 

.881 

12.395 

7.100 

Gynecology 

.915 

8.753 

6.819 

Psychiatry 

.932 

27.017 

8.473 

General 

Surgery 

.945 

7.066 

6.853 

Urology 

.980 

8.071 

6.879 

Orthopedics 

1.028 

16.868 

7.590 

Neurosurgery 

1.200 

11.982 

9.646 

Oncology 

1.282 

15.465 

8.023 

Cardiology 

1.797 

14.256 

7.912 

Cardiothoracic 

Surgery  2.074 

19.093 

10.121 

Mean 

Difference  =  4.46  Days* 

10.121 


CORRELATION  OF  SERVCE  MEAN  LENGTH  OF  STAY  WITH 
MEAN  CASE-COMPLEXITY 


All  Services 

Psychiatry  Excluded 

(N  =  16) 

(N  =  15) 

WHMC  Service  Mean  Length  of  Stay 
and  WHMC  Service  Complexity 

.57* 

.83*** 

HCFA  DRG  Geometric  Mean  Length  of 
and  WHMC  Service  Complexity 

.81*** 

.84*** 

Difference  between  WHMC  and  HCFA 

Mean  Length  of  Stay 

and  WHMC  Service  Complexity 

.39 

.69** 

*  p  <  .05  **  p  <  .01  ***  p  <  .001 


mean  length  of  stay  and  mean  case-complexity,  a  mild  positive  correlation 
resulted  (r  =  .39),  but  the  correlation  was  not  statistically  significant. 
Excluding  psychiatry  served  to  strengthen  the  observed  relationship  (r  =  .69). 
This  correlation  was  now  statistically  significant.  In  other  words,  as  service 
case-complexity  increased  the  difference  between  WHMC  mean  length  of  stay  and 
HCFA  mean  length  of  stay  also  increased. 

DISCUSSION  AND  IMPLICATIONS 

There  was  no  observed  relationship  between  service  mean  cost  per  bed  day 
and  service  mean  case-complexity.  In  services  with  higher  complexity,  the 
patient  days  in  those  services  distributed  costs  over  longer  periods  of 
hospitalization.  In  other  words,  the  major  cost  of  hospital izaton  typically 
occurred  in  the  first  few  days  of  hospitalization  and  was  progressively  diluted. 

When  psychiatry  was  excluded,  both  service  total  and  ancillary  mean  cost 
per  disposition  demonstrated  a  strong  positive  correlation  with  service  mean 


case-complexity.  Service  mean  cost  per  disposition  more  accurately  reflected  the 
major  costs  of  hospitalization,  since  the  dilution  affect  of  a  longer  period  of 
hospitalization  was  removed.  The  lack  of  relationship  demonstrated  between 
service  mean  support  cost  and  mean  case-complexity  was  not  unexpected. 

Consumption  of  support  costs  by  a  service  (e.g.,  pounds  of  laundry  used,  con¬ 
sumption  of  utilities,  etc.)  tends  to  be  influenced  more  by  overall  uitilization 
rather  than  the  type  of  patient  treated. 

The  actual  service  mean  length  of  stay  for  psychiatry  at  WHMC  departed 
markedly  from  the  HCFA  mean  length  of  stay.  Patients  participating  in  the 
Alcohol  Rehabilitation  Program  at  WHMC  must  remain  as  inpatients  for  specified 
lengths  of  stay  as  required  by  OoD  regulations.  For  example,  for  DRG  436, 

Alcohol  and  Substance  Induced  Organic  Mental  Syndrome,  and  DRG  438,  Alcohol 
Dependence,  the  HCFA  mean  lengths  of  stay  are  8.1  and  6.9  days  respectively.  For 
WHMC,  the  length  of  stay  for  DRG  436  was  36.5  days  and  for  DRG  438  the  length  of 
stay  was  25.3  days.  Due  to  the  prescribed  lengths  of  stay  for  the  above 
program,  cost  per  disposition  was  markedly  affected,  decreasing  the  number  of 
dispositions  for  this  service.  This  resulted  in  an  elevated  mean  cost  per  dis¬ 
position.  Study  data  suggest  that  when  using  a  case-complexity  system  based 
upon  the  experience  of  civilian  hospitals,  consideration  must  be  given  to  unique 
DoD  requirements  affecting  treatment  patterns  in  psychiatric  services. 

The  positive  correlation  between  the  difference  of  WHMC  and  HCFA  service 
mean  length  of  stay  and  mean  case-complexity  may  have  been  accounted  for  by  such 
factors  as  the  teaching  mission  and  military  discharge  processing  requirements 
of  WHMC.  This  suggests  that  further  analysis  and  possible  adjustment  for  fac¬ 
tors  such  as  military  teaching  requirements  would  be  necessary  prior  to  imple¬ 
menting  a  case-complexity  based  cost  accounting  system. 

Study  findings  indicated  that  service  mean  cost  per  disposition  more  accura¬ 
tely  reflected  the  resources  required  to  provide  medical  care  based  upon  case 


complexity,  suggesting  that  allocating  resources  to  military  medical  treatment 
facilities  under  a  case-complexity  system  using  mean  cost  per  bed  day  would  be 
inappropriate  and  may  be  counter  productive.  In  other  words,  if  medical  treat¬ 
ment  facilities  were  allocated  resources  based  on  case-complexity,  allocation 
of  resources  based  on  disposition  cost  would  stimulate  efficiency;  whereas, 
resource  allocation  based  on  cost  per  bed  day  could  have  the  opposite  effect. 
This  relationship  is  illustrated  in  Table  11.  These  two  hypothetical  hospitals 

Table  11 


COMPARISON  OF  DISPOSITON  AND  BED  DAY  COSTS 
FOR  TWO  COMPARABLE  HOSPITALS 


Hospital  A 

Aggregate  Case-Complexity  Index:  .9836 

Total  Expense:  $100,000.00 

Total  Dispositions:  60 

Total  Bed  Days:  600 

Mean  Length  of  Stay:  10  days 

Mean  Cost  per  Disposition:  $1,666.66 

Mean  Cost  per  Bed  Day:  $166.66 


Hospital  B 

Aggregate  Case-Complexity  Index:  ,983i 

Total  Expenses:  $100,000.00 

Total  Dispositions:  50 

Total  Bed  Days:  750 

Mean  Length  of  stay:  15  days 

Mean  Cost  per  Disposition:  $2,000.00 

Mean  Cost  per  Bed  Day;  $133.33 


have  identical  mean  case-complexity,  have  the  same  total  expenses,  but  Hospital 
A  treated  more  patients  (i.e.,  a  greater  number  of  dispositions).  However,  the 
average  length  of  stay  in  Hospital  B  is  50%  higher  than  in  Hospital  A  (15  days 
as  compared  to  10  days).  Although  the  cost  per  disposition  is  less  in  Hospital 
A  ($1,666.66  as  compared  to  $2,000.00  in  Hospital  B),  Hospital  B  has  a  lower 
cost  per  patient  day  ($133.33  as  compared  to  $166.66  in  Hospital  A),  as  well 


as  a  greater  number  of  patient  days.  Current  allocation  of  inpatient  resources 
within  DoD  is  predicted  upon  the  number  of  bed  days.  In  addition,  one  perfor¬ 
mance  measure  of  military  medical  treatment  facilities  is  cost  per  patient  day. 
Although  Hospital  A  is  more  efficient  from  the  perspective  of  lost  productivity, 
(i.e.,  absence  from  the  work  place  as  a  result  of  the  shorter  length  of  stay), 
under  current  methods  of  evaluation  Hospital  B  would  be  considered  more  effi¬ 
cient. 

UCA  costs  were  sensitive  to  service  differences  in  case-complexity  when 
the  cost  per  disposition  was  used.  The  results  of  this  study  indicated  that  in 
the  future,  if  military  medical  treatment  facilities  were  allocated  resources 
based  on  cost  per  disposition  rather  than  cost  per  bed  day,  this  method  of 
resource  allocation  would  more  appropriately  reflect  the  case-complexity  of  the 
facility.  If  cost  per  disposition  rather  than  cost  per  bed  day  was  used  as  a 
measure  of  productivity  there  would  be  greater  incentives  for  cost  minimization. 


PRODUCTIVITY  MEASUREMENT  AT  THE  FORT  ORD  MEDDAC 

LTC  C.H.  Moore,  MSC,  USA 


A  narrative  of  LTC  Moore's  presentation  was  not  available  for  publication. 
However,  we  wanted  to  include  at  least  an  outline  of  his  very  fine  and  well- 
received  presentation  in  these  proceedings. 


SEVEN  STEPS  TO 

WORKLOAD  AND  PRODUCTIVITY  MANAGEMENT 
AT  MEDDAC  FORT  ORD 


STEP  1:  DETERMINE  WORK  TO  BE  DONE 

Agreement  between  workcenter  and  top  management  on  the  amount  of  work 
to  be  done  (e.g.,  clinic  visits,  pages  typed,  issues  processed). 

STEP  2:  ESTABLISH  PRODUCTIVITY  OBJECTIVES 

Agreement  on  the  number  of  personnel  to  do  the  work,  establishing  a 
workcenter  productivity  bench  mark. 

STEP  3:  RECORD  WORK  AND  TIME 

Accurately  and  honestly  record  work  accomplished. 

STEP  4:  MEASURE  WORK  AND  PRODUCTIVITY 

Measure  actual  work  done  against  expected  work 

and 

Measure  actual  productivity  against  expected  productivity. 

STEP  5:  ANALYZE  RESULTS 

WHEN  ACTUAL  WORK  ACCOMPLISHED  AND,  ACTUAL  PRODUCTIVITY  THEN  WE  ARE  LIKELY 
IS  IS  TO 

same  as  expected  same  as  expected  move  on  to  next 

activity. 

higher  than  expected  reward;  look  for  more 

work  or  consider  re¬ 
ducing  authorizations. 

question  work  count 
and  manpower  require¬ 
ments;  criticize  if 
indicated. 


lower  than  expected 
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STEP  5:  (continued) 

WHEN  ACTUAL  WORK  ACCOMPLISHED  AND,  ACTUAL  PRODUCTIVITY  THEN  WE  ARE  LIKELY 
IS_  IS  TO 

greater  than  expected  same  as  expected  question  work  count 

and  investigate  any 
policy  and  procedural 
changes  in  workcenter. 

higher  than  expected  reward;  increase  re¬ 

sources  . 

lower  than  expected  question  work  count 

and  investigate  any 
policy  and  procedural 
changes  in  workcenter. 

less  than  expected  same  as  expected  conduct  on-site  audit 

of  reported  work 
hours;  look  for  more 
work  or  consider  re¬ 
ducing  authorizations. 

higher  than  expected  conduct  on-site  audit 

of  both  reported  work 
hours  and  workload. 

lower  than  expected  criticize  or  adjust 

authorizations. 


STEP  6:  TAKE  ACTION 

Take  management  action  if  necessary. 

Take  leadership  action: 

--  stimulate  people 
--  remove  obstacles 
--  improve  procedures  and  policies 
--  discipline 

Adjust  resources 

--  manpower  allocations 

--  fund  hirelag  and  temporary  positions 

--  buy  equipment 

--  alter  facilities 

--  redesignate  space 

STEP  7:  FOLLOW  UP 

Review  problem  workcenters  monthly  to  determine  if  the  corrective 
management  actions  selected  are  producing  expected  results. 
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WHY  THIS  SYSTEM  WORKS 

0  COMMAND  EMPHASIS 

o  DC A  AND  DCCS  ARE  RESULTS  ORIENTED 

o  THE  WORKCENTER  MANAGER  IS  PERSONALLY  INVOLVED 
IN  THE  PROCESS 

o  WE  TAKE  CORRECTIVE  ACTION  PROMPTLY 
o  WE  REALLOCATE  RESOURCES  QUICKLY 
o  WE  FOLLOW  UP 

PROVIDING  CARE 

1.  KNOW  POPULATION  SUPPORTED  AND  BASIC  DEMOGRAPHIC  CHARACTER  1ST  rr 

2.  EPIDEMIOLOGICAL  CHARACTERISTICS  (PRESENT  AND  PAST  UTILIZATION) 

3.  PROJECT  (ESTIMATE)  AN  APPROPRIATE  LEVEL  OF  UTILIZATION 

4.  PROVIDE  EFFECTIVE  AND  EFFICIENT  QUALITY  HEALTH  CARE  WITHIN  AVAILABLE 
RESOURCES 

5.  MANAGE  WELL 

BE  EFFECTIVE 
BE  EFFICIENT 

--  UNIT  COSTS  LOW 
--  LABOR  PRODUCTIVITY  HIGH 

PROPOSITION 

THAT  WITH  THE  NECESSARY  INCENTIVES,  RESOURCE  MANAGEMENT  FLEXIBILITIES,  AND 
AN  INTEGRATED  COST  AND  UTILIZATION  DATA  BASE,  THE  MTF  CAN  FOCUS  ON  A  "TOTAL 
SYSTEM"  APPROACH  TO  EFFECTIVELY  AND  EFFICIENTLY  SATISFY  DOD  HEALTH  CARE  DELIVERY 


REQUIREMENTS. 


PROPOSITION  (continued) 


MTF  CONCERN 


OTHER 


C  H  A  M  P  U  S 


BEFORE 


AFTER 


THE  PREDOMINANTLY  "INTERNAL  INSTITUTIONAL"  APPROACH  WE  USE  TODAY  WILL 


DISAPPEAR. 


WHAT  IS  NECESSARY? 


INCENTIVE  AT  MTF 

The  point  of  health  care  delivery,  mobilization 
responding  and  disaster  planning. 

RESOURCE  FLEXIBILITY 

The  commander's  capability  to  capitalize  on 
opportunities  and  correct  problems  fast. 

INTEGRATED  COST  AND  UTILIZATION  DATA  BASE 


Only  way  the  MTF  can  view  DOD  financed  direct 
care,  CHAMPUS  and  other  health  care  collectively 


INCENTIVES 


1.  Make  MTFs  responsible  for  full  range  of  DOD  costs  to  induce  interest  in  all 
DOD  funded  care. 

2.  Allow  MTFs  to  participate  with  the  patient  in  choosing  whether  care  will 
be  provided  by  direct  care,  supplemental  care,  CHAMPUS,  or  other  federal 
facilities. 

3.  Encourage  prospective  thinking  at  MTF  level  by  asking  commanders: 

a.  To  target  their  own  system  utilization  indicies  to  develop  a  basis  for 
determining  their  own  effectiveness. 

b.  To  obtain  internal  involvement  and  agreement  among  department  chiefs  on 
productivity  and  cost  goals  so  there  is  a  basis  to  measure  their  own 
efficiency.  Then  alter  resource  request  procedures  to  allow  commanders 
to  prospectively  state  clinical  initiatives. 


RESOURCE  FLEXIBILITY 


1.  Raise  floor  on  investment  equipment  purchases  to  allow  more  O&M  procurement. 
Urge  Defense  and  Congress  to  provide  for  co-mingling  of  CHAMPUS  and  Military 
Department  O&M  funds.  This  allows  commanders  to  make  advantageous  traoe- 

of  fs . 

2.  Support  reemerging  congressional  interest  in  making  Operations  and  Mainte¬ 
nance  two  year  appropriations.  This  allows  commanders  to  attain  their 
objectives  in  a  continuous  process,  rather  than  executing  disconnected 
annual  budget  segments. 

3.  Integrate  processes  by  which  MTFs  fill  military  personnel  requirements  and 
prepare  operating  budgets.  This  allows  MTFs  to  advance  prospective 
initiatives  which  will  be  fully  supported  or  fully  nixed.  No  more  imbalance 
between  military  personnel  availability  and  funds  to  accomplish  programs. 

4.  Urge  Congress  to  remove  civilian  end  strength  limitations.  Failing  that, 
civilian  positions  funded  on  reimbursable  basis,  i.e.,  from  CHAMPUS  funds, 
not  to  be  counted  against  end  strength  limitations. 


INTEGRATED  COST  AND  UTILIZATION  (WORKLOAD)  DATA  BASE 


1.  Use  data  already  available.  MTFs  have  enough  data  collection  activities 
now. 

2.  Insure  collateral  systems  can  understand  each  other  without  translators, 
e.g.,  CHAMPUS  inpatient  coding  and  Army  IPDS  coding.  Only  way  MTFs  can 
"see"  their  catchment  areas. 
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3.  Management  reports  derived  from  this  data  base  must  be  clear,  simple,  and 
understood  by  both  providers  and  lower  level  supervisors  -  we're  trying  to 
provide  timely  information  to  clinical  decision  makers,  not  justify  the  need 
for  management  and  quantitative  analysts. 


SUMMARY 

1.  MTFs  can  do  reasonably  well  at  managing  internal  productivity  with  today's 
data.  Simple  techniques  are  effective. 

2.  The  proposition  is  that  with  incentives,  more  flexibility  in  resource  manage¬ 
ment,  and  a  fully  integrated  cost  and  utilization  data  base,  commanders  can 
focus  their  attention  on  all  DOD  funded  health  care  within  their  catchment 
area. 

3.  If  this  is  so,  then  we  can  expand  our  interest  from  institutional  MTF 
management  and  productivity,  to  system  management  and  system  productivity 
within  the  catchment  area. 

4.  We  need  concensus  among  the  military  departments  that  this  is  worth  doing 
and  both  OSD  and  Congressional  support  to  accomplish  it. 


DEVELOPMENT  OF  A  NONPATIENT  CARE  MODEL  FOR  THE 
PERFORMANCE  MEASUREMENT  STUDY 


CPT(P)  James  M.  King,  MSC,  USA 
MAJ  Donald  E.  O'Brien,  MSC,  USA 
A.D.  Mangelsdorff ,  Ph.D. 

Presently,  measurement  of  the  work  performed  by  the  military  health  care 
system  is  restricted  to  those  activities  that  can  be  directly  related  to  patient 
care  through  the  Medical  Care  Composite  Unit.  This  creates  a  problem  when  one 
attempts  to  measure  the  actual  work  performed  by  providers  in  this  setting, 
because  individual  provider  data  are  not  now  collected  in  a  fashion  which  makes 
them  either  easily  retrievable  or  compatible  with  other  data  sources.  In  addi¬ 
tion  many  of  the  activities  performed  by  these  providers  are  mandated  by  their 
roles  as  health  care  providers  in  a  military  setting,  but  do  not  fall  under  the 
rubric  of  patient  care  activities.  Thus,  we  feel  that  a  number  of  activities 
can  be  identified  which  are  critical  to  the  adequate  performance  of  the  health 
care  provider  role  in  the  military  environment,  but  which  do  not  involve  patient 
care. 

Currently,  the  Army  accounts  for  time  spent  by  medical  personnel  in  a 
variety  of  other  than  patient  care  activites  through  an  11%  nonavailability  fac¬ 
tor.  This  factor,  which  is  discussed  in  the  Staffing  Guide  for  U.S.  Army 
Medical  Department  Activities  (1974),  combines  such  diverse  activities  as  PCS 
processing,  leave,  and  sick  time  with  other  than  patient  care  activities  for  all 
grades  and  occupational  specialties.  Regardless  of  a  person's  grade,  specialty, 
job  title,  or  SSI,  this  nonavailability  factor  is  constant.  On  the  face  of  it, 
such  an  approach  would  seem  to  oversimplify  the  situation.  Indeed,  studies  by 
Parker  and  Mayotte  (1979),  Alexander  and  Mangelsdorff  (1980),  Misener  and  Frelin 
(1983),  and  King,  O'Brien,  and  Mangelsdorff  (1983a  and  1983b)  have  shown  that 
time  spent  at  other  than  patient  care  activities  varies  by  grade,  facility,  and 
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facility  type.  Other  researchers  (Langwell  and  Deane,  1980;  Spaulding,  Shelly, 
Domine,  Martin,  and  St.  Claire,  1983)  have  found  that  data  systems  to  effi¬ 
ciently  track  this  sort  of  activity  are  not  yet  available  in  the  civilian  sec¬ 
tor. 

We  decided  to  focus  this  portion  of  the  Performance  Measurement  Study  on 
military  unique  activites,  committees,  clinical  investigation,  and  training/ 
teaching,  as  these  areas  have  received  the  least  attention  under  existing 
workload  accounting  systems  (DOD  Pam.  6010. 10-M,  1979;  DOD  Pam.  6010. 11-M, 

1982;  DA  Pam.  40-XX,  Draft).  The  Uniform  Chart  of  Accounts  Personnel 
Utilization  System  (UCAPERS),  which  is  currently  undergoing  implementation, 
appears  to  be  a  substantial  improvement  over  the  existing  systems,  but  it 
still  relies  upon  labor-intensive  manual  keying  for  data  entry  (Federal  Data 
Systems  Corporation,  1983).  The  clinical  investigation  and  training/teaching 
areas  have  received  explicit  recognition  in  the  Veterans  Administration  model, 
as  described  by  Thomas,  Berki,  Wyszewianski ,  and  Ashcraft  (1983),  but  the  com¬ 
mittee  function  is  almost  always  lumped  into  overhead  costs  in  all  systems.  The 
military  unique  activities  are  not  captured  in  a  fashion  which  is  compatible 
with  other  aspects  of  the  Performance  Measurement  Study.  To  the  maximum  extent 
possible,  we  propose  that  our  study  be  compatible  with  Uniform  Chart  of  Accounts 
(UCA),  Uniform  Staffing  Methodologies  (USM),  and  UCAPERS  in  coding,  definitions 
of  terms,  and  with  the  Ambulatory  Care  Data  Base  (ACDB)  portion  of  the  Perfor¬ 
mance  Measurement  Study  (PMS)  in  terms  of  test  sites.  These  data  will  be  input 
and  manipulated  either  through  the  ACDB  hardware  using  the  optical  mark  reader 
technology,  or  within  the  context  of  UCAPERS.  The  data  element  lists  and  forms 
designs  have  been,  and  will  continue  to  be,  developed  in  consultation  with  clin¬ 
ical  and  administrative  professionals  working  in  medical  treatment  facilities. 

We  are  proposing  a  two  level  system  of  data  capture.  The  first  level  will 
involve  the  capture  of  data  related  to  group  activities.  This  level  will 
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i da.ni i f  y  the  numbers  of  participants  by  categories,  and  will  identify  the  type 
of  activity  accompl l shed.  The  type  of  activity  will  be  explicitly  related  to  a 
UCA/U5M  fourth  level  code  as  described  in  the  Health  Services  Command  UCA/U3M 
Standard  Fourth  Level  Coding  F_Y  84  manual  (  1983).  The  following  figures  are 
intended  to  present  desired  data  elements,  rather  than  final  forms  designs. 
Figure  1  contains  the  data  elements  related  to  the  clinical  investigation 
mission  and  to  the  committee  function.  Figure  2  contains  the  data  elements 
related  to  the  military  training  and  readiness  mission  of  a  medical  treatment 
facility.  Figure  3  displays  a  listing  of  those  elements  needed  to  account  for 
the  time  spent  in  nursing  education  and  training.  Figure  4  shows  the  data  ele¬ 
ments  which  might  be  captured  on  the  types  of  personnel  involved  in  a  particular 
activity.  These  data  would  be  used  in  conjunction  with  the  data  elements 
displayed  in  Figures  1  and  2.  Figure  5  is  an  example  of  the  data  which  might  be 
collected  on  rostered  additional  duties. 

The  second  level  of  data  capture  will  be  keyed  to  specific  individuals.  For 
the  purposes  of  this  study,  data  would  be  collected  from  individuals  who  have 
provider  codes  for  the  Ambulatory  Care  Data  Base  portion  of  the  Performance 
Measurement  Study.  They  will  be  asked  to  indicate,  on  a  periodic  basis,  the 
amount  of  their  time  spent  in  specific  activities,  each  of  which  is  related  to  a 
specific  dCA/USM  fourth  level  code.  Examples  of  the  types  of  data  elements 
wr'icri  are  required  are  shown  in  Figure  6.  It  may  prove  feasible  to  collect 
these  data  through  more  frequent  iJSM  or  UCAPERS  Clinician  Surveys,  or  through 
tne  :Jr AD E K S  Clinician  Utilization  Worksheet,  if  appropriate  changes  can  be 
r.  j.-  poraled  into  tnese  systems  (DOJ  Pam.  6Ui0.11-M,  1982;  DA  Pam.  40-Al,  Draft; 
r  e .1  e r a  •  uata  Systems  Ccrpo” at ior  ,  1983  )  at  the  various  study  sites  for  the 
Ami). 1 1  ato’-y  Care  Data  Base  portion  of  the  Performance  Measurement  Study.  This 
1  mu  age  is  c  ^uc’al  it  data  from  all  portions  of  the  Performance  Measurement 
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Study  are  to  be  available  on  a  reasonable  sample  of  medical  treatment  facili¬ 
ties. 

Using  tnese  sorts  of  inputs,  we  hope  to  develop  a  provider  utilization 
model,  as  outlined  in  Figure  7.  We  will  account  for  provider  time  as  follows: 
Non-avai lable  time  (NAT)  will  be  defined  as  leave,  sick,  and  permanent  change  of 
station  travel;  Available  time  (AT)  will  be  determined  by  totaling  patient  care 
time  (PCT),  consisting  of  inpatient  and  outpatient  care,  and  nonpatient  care 
time  (NPCT),  consisting  of  tne  readiness,  clinical  investigation,  committees, 
teaching/training,  and  other  activities  outlined  above.  Theoretical  total  time 
will  be  defined  as  the  sum  of  AT  and  NAT.  We  hope  to  use  the  providers  PCT  pro¬ 
ductivity,  as  documented  by  the  other  sections  of  the  Performance  Measurement 
Study,  to  develop  a  weighting  system  for  NPCT  activities  which  will  allow  us  to 
attach  explicit  values  to  these  activities. 

The  present  approach  has  the  advantage  of  directly  acknowledging  the  value 
of  these  various  NPCT  activities,  to  include  readiness,  required  training,  addi¬ 
tional  duties,  physical  training,  clinical  investigation,  and  teaching/training. 
It  recognizes  that  they  are  an  integral  part  of  the  mission  of  the  military 
medical  treatment  facility,  and  it  gives  explicit  credit  for  their  accomplish¬ 
ment.  Our  approach  assumes  that  time  spent  in  NPCT  activities  is  as  productive, 
in  the  context  of  the  Military  Health  Care  System,  as  time  spent  in  PCT  activi¬ 
ties.  It  is  our  intention  to  incorporate  indices  of  the  quality  of  the  NPCT 
activities  into  our  measures  as  they  become  available.  The  present  concept  is 
in  agreement  with  the  efficiency  -  efficacy  model  of  performance  measurement 
discussed  by  LTC  Coventry  elsewhere  in  these  proceedings.  As  a  bi-product  of 
cur  study,  we  believe  that  the  outputs  of  our  data  collection  process,  using  the 
optical  mark  reader  method  of  data  input,  will  enable  the  facilities  to  produce 
the  currently  required  input  for  UCA,  USM,  and  UCAPERS  in  a  more  cost  effective 
and  timely  manner  than  would  be  possible  with  manual  keying  of  •'>put.  In  the 


event  that  we  work  through  UCAPERS,  we  will  avoid  any  redundancy  of  data  collec¬ 
tion  efforts  within  the  facilities.  These  data,  in  conjunction  with  the  data 
collection  and  processing  methodologies  proposed,  will  make  it  possible  for  the 
facilities  themselves  to  produce  reports  which  will  enable  all  levels  of  manage¬ 
ment  within  the  AMEDD  to  gain  a  more  complete  understanding  of  the  demands 
placed  upon  their  facilities  by  NPCT  missions,  to  prioritize  these  NPCT  missions 
in  relation  to  other  facility  missions,  and  to  document  the  outputs  generated  to 
satisify  these  mission  demands.  These  capabilities  are  not  now  fully  realized 
in  any  existing  or  proposed  AMEDD  or  DOD  data  system. 
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!  PRODUCTIVITY  MEASUREMENT  AT  US  GENERAL  LEONARD  WOOD 
ARMY  COMM UN  , r  HOSPITAL 

M*«,;  .j*;. An  Abshire,  MSC,  USA 

w  J  ic  :  i  ■/  i  ~y  measurement  at  the  General  Leonard  Wood  /*my  Community  h'ospit... 
o-  . '  :  a  t  •>'  reporting  the  accomplishments  of  the  physician?  is  a  relative' 

.  $s  ;c.  Within  the  1  e > t  nine  months  there  has  been  ?  concerted  effort  at  rr 
hospital  1  no a?  ure  r^odoctivi ty.  Why  do  we  want  to  measure  productivity?  Our 
.TiCing  in  this  effort  has  been  the  goal  of  MAXIMIZING  ACCESS  OF 

PAT  i.TNTJ  :‘J  QUALITY  CAPE.  We  already  do  this  in  many  ways.  We  have  Central 
Eopointmonr  Syc tern:  (CrS),  increased  telephone  ’’nes,  call  monitors,  and  mini- 
-  .  n-.powe'  .  '  ,s.  We  monitor  no-show  rates,  *•  u  1  d  Community  Health  Care  Board 

meetings,  and  >  measure  tne  productivity  of  providers.  Quality  and  Quantity-- 
pnw  do  •*,.  rr,,  -  iit,'.  ze  *ne:e  two  pieces  of  a  very  large  health  care  puzzle  which 
see.  to  i  f  c /ranging  U  in-  present  t  A  «  we  have  what  we  .  ee  ■  is  a  very  good 
Quality  sc  ranee  Program  at  • n:>  hospital.  What  we  hope  to  achieve  is  to  devel 
r a.i  accurate  and  Mmely  measure  of  productivity  that  will  allow  our  managers 
to  detsrir if  the  tradeoffs  for  quality  are  acceptable  in  rel  ation  to  the 
?  va  '  nbi  1  i r  y  of  z  *e ;  or  i  f ,  m  fact,  w  are  putting  so  much  effort,  into  qua!  it 
care  T.m.  - r  >:«  ar.  dr* eat  ing  ourselves  (and  potential  iy  reducing  our  resources  j  U 
a-nviri  ,-r  •  !  a .  ing  fl-e  cr'an.  ;r.\  of  care  cr  tun  many  heref  iciaru-s  for  too  1  ~n  ; 

PHYSIC  ;  h  rPt.'DUCTIV K Y  REPORT 

W 1 .  i  '■  -■■■“  1  .•  ?  v-.-'  .r  v  I  to  te-i’n  \o  a  hires  s  to*  product  i  <•  i  ty  ;;  ’•  e  *  .  - 

■■  e.izz  ■'  ,  ma'-ice  it  !n?r  i;-.  y  .  jdurt  '  ’  t  >  .  e ;  ..  ■ 

r  ad  •'  ••  br  c.  '  ■:  •  c»r  .•*’  ' '  •  /  ..  *  »  opr sellint;  jv.  g,  .  omp  ■  isheo  'r  >  r  ■■ 

i  )  ^  ■■  *  t  r.  -ff.  -i1.  w  ■-  .y  -  i'  '■  :  p  'i  t  pt  on<T 

•p  ;  1  ,  ■  r  !  >  !  •  -  \  r  •  •_,••••  ’  h  ,  '('lid  provide  .  IP'-  .1  *  ;.0  ’ 

;  .  j .  -  -  •  tPt  re  ■, :  .■  p  .  a ::  t  f  >  •  •  ■.  -  *■  r  \  r  o ,  i  <■  ;  or  - 1  w. 
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finally  arrived  at  what  we  call  the  Physicians'  Productivity  Report.  In  a  way 
this  title  may  be  misinterpreted  to  be  a  measure  of  the  productivity  of  each 
individual  physician.  However,  it  does  not  directly  measure  individual  produc¬ 
tivity  but  measures  productivity  on  a  department  or  clinic  basis.  This  report 
addresses  only  physicians  and  nurse  clinicians  who  produce  MCCU  workload. 

The  report  is  produced  by  the  second  Friday  of  each  month  utilizing  the  data 
from  the  previous  month.  The  report  is  presented  to  the  Utilization  Review 
Committee  and  a  copy  provided  each  department  and  clinic  chief.  The  report  is 
produced  manually  and  once  all  the  input  data  is  collected  it  takes  approxi¬ 
mately  one  manday  to  complete  the  report.  Input  data  for  the  report  comes  from 
the  following  sources: 

a.  MED  302  Report 

b.  Manpower  Availability  Report 

c.  Clinician  Surveys 

a.  DA  Pam  570-557  (Staffing  Guide  for  Army  Medical  Department  Activities) 

The  productivity  report  is  presented  in  two  parts,  the  first  (Figure  1) 
being  the  breakout  of  hours  spent  in  each  clinic  area  as  reported  on  the  clini¬ 
cian  surveys.  The  total  hours  are  reported  on  the  Manpower  Availability  Report. 
The  percentage  reported  on  the  clinician  surveys  are  applied  to  the  total  hours 
and  the  hours  for  each  clinic  are  reflected  on  the  report  for  each  physician. 

The  hours  in  each  clinic  area  are  totaled.  The  next  item  of  information  pre¬ 
sented  is  the  clinic  visits  associated  with  physicians  in  each  clinic  area,  and 
average  daily  beds  occupied.  The  clinic  visits  and  occupied  beds  come  from  the 
MtD  302  Report.  The  total  hours  in  each  clinic  area  are  divided  into  the  clinic 
visits  and  visits  per  hour  are  reported.  The  staffing  guide  criteria  (DA  Pam 
570-557)  for  that  clinic  is  reflected  on  the  report.  And  finally  an  analysis  of 
the  month's  productivity,  or  access  to  quality  care,  is  compared  to  the  staffing 


guide. 


c  uo  a>\  or  oo 


VO  |  Ch  VO 

ro  o  kj-' 


L. 

0) 

3 

>  >- 

in 

flO  D 

0>  h- 

c 

r— 

<TJ 

VO 

00 

»r- 

m  >> 

• 

O 

>»  <o 

r— 

•r* 

«0  TO 

C 

T3 

ir» 

• 

»— 

CVJ 

c 

a 

>■> 

m  a; 

•r" 

<v 

3  C 

u  • 

X 

u  o 

•r-  (/) 

4-> 

to  o 

m  "U 

O  — 1 

££ 

on 

•  • 

CL 

L.  L. 

TO 

tn 

o  o 

1-  OJ 

c 

O)  -r- 

#0 

CL  CL 

3 

u 

(/)  O 

•r— 

00  4->  o 

«/) 

•1  •«-  o 

>T 

in 

X 

.  O 

CL 

vo  >  ro 

• 

ai 

•—1  ox 

X 

o  o 

4-J 

CO  «3 

a> 

>> 

•  • 

ii 

X 

in  in 

i- 

4-»  TO 

o  cu 

TJ 

CL 

C.  CL 

& 

in  x 

4-> 

X 

to  c 

in 

>  -o 

o 

*3  to 

•r- 

a> 

cc 

.  CO  f- 

C 

u  •»- 

<r 

r-  O 

c. 

**-  X 

•  1  *C  -r- 

3 

C  3 

CO 

m 

4- 

f-  o 

> 

o  >> 

*—  u 

< 

•*-  X 

in 

u  o 

^  ro  cvj 
m  o  «s-  *r 


m 

Q)  r— 
T3  • 
3  C\J 


T5TJ  t3  • 
0)  0J  OJ  “C 
•»«-  •«-  -r-  <L 
4-4-4-  C 

o> 

4J  *J  P  •<- 
i/>  (✓>  in  iS 
3  3  3  in 
•r-j  p-5  *r“>  fO 


»/)  l/»  lO  VI 

c  c  c  c 

•C  IO 


o  o 

vo  ir>  csj 


CO 

>  or 
c  o 
o 


u  u  u  o 

•r  -r*  •»*  *r* 
VI  VI  0  VI 

>*>,>>  >^ 
x  x  x  x 
a  a  a  a 

n  co  ^  n 


< 

o  >-  <  z 

— >  00  UJ  oc.  oo 

oo  rj  z  uj  ai 

>-  o  ©  ;»  — < 

X  «  O  UJ  UJ 

a.  ©  —I  O  3 


<v 

cn 

•  • 

*3 

1/) 

4-> 

►-H 

C 

IS) 

0J 

>- 

u 

—1 

L. 

c 

0) 

z 

CL 

c 

As  you  can  see  in  the  Medical  Clinic  we  have  maximized  productivity  as 
measured  by  the  staffing  standard.  But  we  have  not  really  determined  if  each 
provider  is  achieving  his  potential  and  providing  maximum  access  to  quality  care 
or  if  some  other  provider  carrying  a  greater  amount  of  the  workload. 

The  second  part  of  the  report  is  a  graph  which  reflects  the  trends  of  the 
past  months  (Figure  2).  As  you  can  see,  there  has  generally  been  an  increase  in 
productivity.  After  close  scrutiny  of  the  report  by  the  department  or  clinic 
chief  and  armed  with  the  knowledge  of  the  capabilities  of  his  physicians,  he  can 
determine  who  is  or  is  not  being  fully  productive.  Figures  3  through  8  provide 
examples  of  this  analysis  for  three  additional  clinics. 

Since  we  initiated  this  report  the  following  positive  aspects  have  been 
realized: 

a.  Department  Chiefs  have  up-to-date  status  of  department  productivity 
which  they  can  use  to  evaluate  physician  capabilities. 

b.  Informs  department  chiefs  of  impact  of  workload  (productivity)  on 
staffing. 

c.  Provides  a  mechanism  for  Cdr/DCCS  to  identify  problems  or  to  shift 
resources . 

d.  Provides  a  mechanism  to  generate  peer  pressure. 

e.  Identifies  workload  trends. 

f.  Provides  backup  information  for  manpower  surveys. 

g.  Provides  a  mechanism  to  identify  need  to  update  clinician  surveys. 

The  following  actions  have  been  taken  as  a  result  of  the  information  pro- 

vi ded : 

a.  Productivity  has  increased  and  we  have  provided  better  access  of 
patients  to  quality  care. 

b.  Used  in  MEDOAC  mini-manpower  surveys  we  shift  resources  to  improve  sup¬ 
port  and  patient  care. 

c.  Problem  areas  in  certain  clinics  have  been  identified  for  management 
resolution . 
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4  physicians  assigned. 
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Or.  Crawford  -  30  days  maternity  leave 
Dr.  Nelson  -  13  days  leave. 

1.0  MM  worked  in  areas  other  than  ACC. 
1 .6  MM  was  leave. 


ACC  CLINIC 


Shortcomings  of  the  report  are: 

a.  Does  not  break  out  workload  by  physician. 

b.  Concentrates  only  on  MCCU  producing  physicians. 

c.  Efforts  of  support  physicians,  i.e.,  Radiologist,  Pathologist,  PM  are 
not  reflected. 

OUTPATIENT  PRODUCTIVITY  REPORT 

When  we  addressed  the  issue  of  productivity  we  quickly  recognized  that  the 
Central  Appointment  Section  (CAS)  must  be  a  vital  link  in  the  chain.  The  CAS 
contribution  to  the  productivity  puzzle  has  been  what  we  call  the  Outpatient 
Productivity  Report  (Figures  9  and  10).  Utilizing  this  report  the  department 
and  clinic  chiefs  have  a  good  mechanism  for  evaluating  productivity  as  it  re¬ 
lates  to  outpatient  care.  The  Outpatient  Productivity  Report  is  produced  each 
week,  but  one  major  problem  we  hope  to  resolve  is  that  with  limited  resources, 
the  report  is  not  produced  in  a  timely  manner.  The  14-18  May  report  was  pro¬ 
duced  on  4  June.  Hopefully,  with  automation  and  refinement  of  the  reporting 
system  we  can  produce  a  more  timely  report. 

The  report  lists  each  provider,  MC ,  PA,  nurse  clinician,  MSC,  by  name  and 
provides  hours  of  authorized  absences  for  each  health  care  provider,  CAS 
appointed  hours,  clinic  scheduled  hours  (primarily  for  ward  rounds,  meetings, 
etc.,  walk-in  patients,  etc.).  It  reflects  the  number  of  walk-in  patients,  the 
number  of  clinic  appointed  patients  and  CAS  appointed  patients.  The  total 
patients  are  then  divided  by  the  number  of  days  that  the  providers  are  in  the 
clinic. 

This  report  provides  the  manager,  department  and  clinic  chief,  with  the 
actual  workload  or  productivity  of  each  provider  each  week.  If  the  manager 
evaluates  the  data  over  an  extended  period  he  may  be  able  to  evaluate  negative 
trends  and  institute  corrective  action  where  appropriate. 
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We  have  pursued  the  possibility  of  combining  the  Outpatient  Productivity 
Report  and  the  Physician  Productivity  Report,  but  because  of  varying  reporting 
periods  and  the  difficulty  of  producing  the  Outpatient  Productivity  Report  in  a 
timely  manner  we  could  not  combine  them.  However,  if  we  could  we  could  possibly 
produce  a  report  which  ties  individual  productivity  to  departmental  staffing 
criteria  or  self  imposed  goals,  we  could  produce  greater  peer  pressure,  and  a 
methodology  for  comparing  the  productivity  of  the  physician  who  sees  patients 
with  complex  problems  versus  a  physician  who  sees  acute  respiratory  disease,  and 
relating  all  this  to  quality. 

MILITARY  AVAILABILITY  REPORT 

The  impact  of  nonavailable  time  and  military  duties  on  assigned  hours  has 
long  been  a  concern  of  the  MEDDAC  in  relation  to  mission  accomplishment  and 
maximizing  access  of  patients  to  quality  care.  The  emphasis  on  readiness  and 
supporting  the  global  mission  has  required  involvement  in  field  training  exer¬ 
cises,  CTT,  PT  programs,  and  other  related  duties,  has  taken  personnel  away  from 
their  duties.  In  the  near  future  the  MEDDAC  will  undergo  a  manpower  survey.  In 
an  effort  to  determine  the  impact  of  the  nonavailable  time  and  military  duties 
we  have  analyzed  the  utilization  of  the  military  personnel  on  three  wards,  four 
support  branches,  and  six  clinics.  The  first  two  quarters  of  fiscal  year  84 
were  evaluated.  The  data  for  the  analysis  was  collected  from  the  Uniform 
Staffing  Methodology  ( USM )  report.  We  have  collected  this  military  unique  time 
since  March  1982.  The  information  reflected  in  Figure  11  lists  the  workcenter, 
the  available  hours,  the  hours  assigned,  the  hours  nonavailable,  and  hours  of 
military  duty  for  the  first  and  second  quarters.  In  all  areas  the  hours  nona¬ 
vailable  exceeded  the  11*  nonavailability  factor  prescribed  in  the  current 
staffing  guide  criteria  and  in  many  instances  the  percentage  of  time  devoted  to 
military  duties  exceeded  the  11*  factor.  In  summary,  the  percentage  of  military 


320 


to  ex 
az  * sr  >- 
O  H-  H- 
O  *-«  =5 
x  -i  e 


VO  CO 
CVJ  CVJ 


*—  <NJ 
VO  vo 
GO  <t 


00  o 

GO  O 
vO  CVJ 
fvj  •— 


>- 

H* 

*— « 

^r 

00 

>- 

«c 

Ll_ 

—1 

to 

a 

UJ 

♦— 

oz 

»— 1 

oc 

X) 

T-~i 

UJ 

O' 

QJ 

o 

o 

k. 

ex 

X 

O 

X 

CVJ 

cr. 

•r* 

$ 

o 

Li_ 

z 

>~ 

ex 

«c 

«r 

>— 

*- 

CO 

-j 
*— < 

X 

to  -j 
ocr  « 
D  O  < 
O  X  >• 
X  <. 


o  o 
o  *r 

r-*  vO 


to  o 

GO  •— 
co  *— 


CVJ  CO 

co  cvj 


CVJ 

CVJ 

tO 

vO 

o 

r*- 

vO 

r*- 

co 

CO 

lO 

cx» 

vo  ro 
O 

co 


vO 

CO 

CVJ 

<r» 

to 

CVJ 

to 

cO 

CVJ 

rr 

CO 

CVJ 

O'- 

X 

CO 

CO 

to 

to 

r— 

00 

ex 

K— 

oc 

►— 

(X 

♦— 

ex 

s? 

ex 

X- 

ex 

*— 

JO 

E 

Cr 

O 

VJ 

s 

O' 

cr 

O 

cr 

o 

«r 

h* 

o 

E 

h- 

O 

o 

i— 

o 

CO 

z 

L. 

to 

Z 

x: 

to 

z 

o 

co 

o 

CVJ 

#0 

XT 

ex 

CVJ 

4-> 

*3 

ex 

CVJ 

Nutrition  Care  Div 

1ST  QTR  4093  4452  835  544 

2ND  QTR  3997  4296  535  413 


MILITARY  MANPOWER  AVAILABILITY 


321 


1ST  QTR  20.4  13.3  18.8  12.2 

2ND  QTR  13.4  10.3  12.5  9.6 
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duties  is  requiring  more  and  more  time  and  reducing  the  availability  of  these 
personnel  to  support  the  health  care  mission. 

SUMMARY 

In  summation  what  we  want  to  develop  is  a  methodology  to  analyze  and  eval¬ 
uate  the  potential  for  each  provider  to  provide  quality  care  at  maximum  produc 
tivity.  We  need  to  know  how  productive  each  provider  is  in  doing  what 
(diagnosis),  and  then  be  able  to  audit  some  of  what  he  is  doing  in  terms  of 
quality. 


A  DATA  BASED  QUALITY  ASSURANCE  PROGRAM 


MAJ  Donald  E.  O'Brien,  MSC,  USA 
CPT(P)  James  M.  King,  MSC,  USA 
A.D.  Mangelsdorff ,  Ph.D. 

Accurate,  timely,  and  appropriate  data  are  the  cornerstones  of  any  Quality 
Assurance  Program  (QAP).  Modern  automatic  data  processing  systems  have  made  It 
possible  for  program  managers  to  obtain  data  necessary  for  the  enlightened  man¬ 
agement  of  hospital  programs.  This  paper  will  present  a  model  of  a  multihospi¬ 
tal  QAP  for  a  Major  Medical  Command  (MEOCOM.) 

The  plan  suggesed  here  departs  from  the  traditional  concept  of  the  role  of 
the  MEOCOM  In  that  It  recommends  that  the  MEOCOM  conduct  a  QAP  by  actively  moni¬ 
toring  the  care  being  given  in  Its  facilities,  and  that  It  abandon  the  outdated 
concept  that  MEOCOMs  should  only  monitor  the  Quality  Assurance  Plans  of  subor¬ 
dinate  activities.  Only  by  realizing  the  implications  of  this  distinction  and 
following  through  with  the  logical  staffing,  tasking,  and  organization  of  a 
Quality  Assurance  Division  will  modern  and  operationally  sound  QAPs  come  into 
existence  In  our  MEOCOMs. 

MEOCOM  QUALITY  ASSURANCE  DIVISION 

In  our  proposed  schema  each  MEOCOM  would  have  QA  Division  (QAD),  charged 
with  monitoring  the  level  of  medical  care  provided  by  the  command's  subordinate 
medical  treatment  facilities.  This  QAD  would  be  responsible  for  regularly 
reporting  Its  findings  to  the  MEDCOM  Commander. 

The  QAO  should  ideally  be  divided  into  two  separate  sections.  The  first 
section  should  be  a  Patient  Care  Evaluation  Branch  (PCEB),  and  the  second  sec¬ 
tion,  a  Quality  Assurance  Support  Branch  (QASB).  The  QASB  would  concern  itself 
with  the  many  administrative  duties  required  of  a  QAD.  It  is  with  the  functions 
and  operations  of  the  PCEB  that  the  remainder  of  the  paper  will  concern  itself. 


As  a  minimum  the  PCEB  must  have  a  Biometric  Data  Analyst  (BDA)  and  a 
Patient  Care  Evaluation  Physician  (PCEP)  assigned,  having  the  following  quali¬ 
fications  and  duties:  The  Biometric  Data  Analyst  should  have  experience  as  a 
hospital  PAD  officer  or  as  a  clinical  practitioner  and  administrator  in  one  of 
the  ancillary  care  specialties.  He  must  be  able  to  request,  analyze,  and 
interpret  data  appropriate  to  the  mission  of  the  PECB,  using  appropriate  data 
analysis  procedures.  The  Patient  Care  Evaluation  Physician  should  have  a  well 
rounded  background  in  clinical  medicine,  and  be  familiar  with  automated  data 
management  techniques. 

Both  of  these  officers  would  be  expected  to  make  recommendations  to  the 
MEDCOM  Commander  and  the  Commander  of  the  Biometric  Data  Facility  regarding 
the  appropriateness  of  the  data  being  collected  by  the  patient  data  systems, 
and  the  technical  aspects  of  the  data  collection,  editing,  input,  and  re¬ 
trieval  systems  as  they  impact  on  the  ability  of  the  PCEB  to  effectively 
monitor  the  MEDCOM  QAP . 

THE  QUALITY  ASSURANCE  DATA  ANALYSIS  SYSTEM 

The  QA  Data  Analysis  System  is  characterized  in  Figures  1  and  2.  Figure  1 
shows  the  MEDCOM/MTF  Command  Structure  and  indicates  that  the  MTFs  submit  data 
on  patients  treated  at  their  facilities  to  a  data  holding  facility  at  regular 
intervals  in  accordance  with  existing  regulations.  The  model,  as  presented 
here,  is  equally  applicable  to  the  Biostatistics  Activity  of  any  service  and  to 
any  of  the  Major  Medical  Commands.  Although  the  activities  now  deal  solely  with 
inpatient  data,  the  schema  outlined  here  will  also  apply  to  outpatient  data  when 
these  are  introduced  into  the  system  at  a  later  date. 

Figure  ?  indicates  that  the  Biometric  Data  Facility  (BDF)  compiles  and  edits 
the  data  reported  by  the  MTFs  and  issues  reports  based  on  that  data  to  the  MTFs 
and  to  the  PCEB.  Both  the  MTFs  and  the  PCEB  have  the  ability  to  request  special 
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reports  from  the  BDF  to  obtain  data  on  patient  care  that  relates  to  their  par¬ 
ticular  mission.  Examples  of  some  standard  reports  that  could  be  generated  by 
the  Biometrics  Data  Facility  are  shown  in  Table  1. 
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Table  1 

EXAMPLES  OF  BDF  REPORTS 


1.  Reports  to  the  individual  MTF: 

a.  Inpatient: 

(1)  Number  of  patients  by  service  by  DRG. 

(2)  Mean  LOS  by  DRG  plus  criterion  LOS. 

(3)  Number  of  cases  with  signal  events. 

(4)  Number  of  cases  with  multiple  major  surgical 
procedures  by  DRG  by  provider. 

(5)  Number  of  patients  readmitted  within  a  speci¬ 
fied  time  period  after  discharge  by  DRG  by 
provider. 

b.  Outpatient: 

(1)  Number  of  patients  seen  by  clinic/service. 

(2)  Number  of  patients  returning  for  the  same 
complaint  within  a  specified  time  period 
and  subsequently  admitted. 

2.  Reports  to  PCEB: 

a.  Inpatient: 

(1)  Number  of  patients  by  MTF  by  DRG. 

(2)  Mean  LOS  by  DRG  by  MTF  plus  criterion  LOS. 

(3)  Number  of  cases  with  signal  events  by  MTF. 

(4)  Number  of  cases  with  multiple  major  surgical 
procedures  by  DRG  by  MTF. 

(5)  Number  of  patients  readmitted  within  a  speci¬ 
fied  time  period  after  discharge  by  DRG  by 
MTF. 

b.  Outpatient: 

(1)  Number  of  patients  seen  by  MTF. 

(2)  Number  of  patients  returning  for  the  same 
complaint  within  a  specified  time  period 
and  subsequently  admitted  by  MTF. 
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For  each  quality  of  care  indicator  reported,  there  will  also  be  a  correspon¬ 
ding  accepted  standard  of  care  to  which  that  indicator  can  be  compared.  MTFs 
will  be  grouped  in  categories  that  allow  realistic  comparison  of  their  perfor¬ 
mance.  For  example,  data  on  MEDCENS,  and  large,  medium,  and  small  hospials  will 
be  presented  separately,  and  norms  could  be  based  on  civilian  or  military  data. 

Because  many  variables  are  available  for  analysis,  the  projected  standard 
monthly  reports  would  cover  only  a  small  portion  of  the  data  available  through 
the  BDF.  Therefore,  the  PCEB  would  regularly  request  special  reports  that 
investigated  in-depth  one  or  more  data  items  available  in  the  Individual  Patient 
Data  System.  For  example,  the  PCEB  might  decide  to  investigate  care  given  to 
patients  in  DRG  #000  for  the  month  of  May  1984.  The  PCEB  would  then  request 
descriptive  statistics  for  DRG  #000  for  the  month  of  May  1984,  by  MTF,  from  the 
BDF.  The  BDF  would  make  a  data  run  and  forward  the  report  to  PCEB.  In  this 
report  MTFs  would  be  identified  by  code  number  only.  This  report  would  include 
all  MTFs  that  had  patients  categorized  in  DRG  #000  during  May  1984.  The  PCEB 
would  analyze  the  data  to  see  if  either  the  system  as  a  whole,  or  any  individual 
MTFs,  showed  indications  of  significant  deviations  from  established  norms.  (The 
norms  might  be  system  wide,  national,  regional,  or  based  on  comparable  MTFs.) 

If  no  deviations  were  found,  the  analysis  of  this  DRG  could  stop  at  this  point. 
However,  if  any  MTF  is  found  to  have  a  mean  LOS  greater  than  some  predetermined 
criteria,  the  PCEB  would  continue  the  investigation. 

The  next  step  of  the  PCEB  would  be  to  look  at  the  DRG  by  clinical  service  in 
the  hospital (s)  with  the  above  average  LOS.  The  PCEB  would  request  a  report 
from  the  BDF  showing  the  PRO  for  the  MTF  in  question  by  clinical  services,  LOS, 
patient  category  (e.g.,  AD,  Ret),  by  age  group,  and  by  sex.  At  this  point  it 
is  quite  conceivable  that  the  data  would  present  a  clear  picture  of  the 
majority  of  patients  falling  well  within  the  accepted  LOS  for  the  DRG,  with 


only  one,  or  a  very  few,  outliers  whose  longer  LOS  can  be  easily  explained  by 
the  other  categories  or  information. 

If  the  PCEB  was  satisfied  that  it  had  adequately  explained  the  variant  data 
for  the  DRG  in  question,  the  analysis  would  terminate  at  this  point.  If  there 
was  still  a  question  about  a  particular  case,  or  group  of  cases,  the  PCEB 
could  request  a  printout  of  the  individual  case  by  discharge  abstract  from  the 
BDF.  The  PCEB  would  study  this  abstract  to  see  if  the  information  contained 
therein  answered  their  questions.  If  so,  the  analysis  would  terminate.  If  the 
PCEB  could  find  no  adequate  explanation  of  the  extended  LOS  for  the  DRG,  it 
would  have  the  option  of  contacting  the  MTF  commander  for  further  information  on 
the  case  or  cases  in  question. 

In  future  years,  software  could  be  developed  by  the  PCEB  and  the  BDF  which 
focused  on  selected  diagnoses  or  DRGs.  Critical  values  for  LOS  would  be 
established  and  the  data  generated  automatical ly  when  aberrations  occurred. 

This  type  of  development  is  more  sophisticated,  but  would  relieve  the  PCEB  of 
much  burdensome  data  analysis  in  its  day  to  day  operations.  This  step  would 
also  facilitate  the  ability  of  the  BDF  to  provide  critical  and  timely  infor¬ 
mation  to  the  MTFs  and  to  their  patient  care  mission. 

A  similar  situation  would  exist  in  regard  to  the  reports  sent  to  the  com¬ 
manders  of  the  MTFs.  They  would  receive  only  a  small  portion  of  the  data 
available  in  the  BDF  data  base.  However,  the  MTFs  then  could  also  request  spe¬ 
cial  reports  about  their  own  facilities,  which  would  allow  them  to  investigate 
suspected  problem  areas  in  detail. 

The  above  process  would  provide  the  MEDCOM  Commander  with  a  modern  quan¬ 
titative  tool  for  both  determining  the  quality  of  care  provided  in  the  command, 
and  for  making  decisions  as  to  the  allocation  of  resources  in  order  to  achieve 
the  desired  level  of  patient  care.  It  would  also  provide  the  MTF  commanders 
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with  new  tools  to  appraise  the  relative  effectiveness  of  their  facility  in  pro¬ 
viding  quality  care  to  the  patients  they  serve. 

EFFECTS  OF  THE  SYSTEM 

The  process  described  above  presents  a  'ystem  to  macroscopical ly  monitor 
the  quality  of  care  being  delivered  in  a  MEDCOM.  Although  conceptually  simple, 
it  relies  on  three  essential  factors  to  be  successful:  (1)  command  support;  (2) 
trained  personnel;  and,  (3)  a  sophisticated  and  flexible  data  collection,  analy¬ 
sis,  and  distribution  system. 

The  true  value  of  such  a  system  will  be  realized  if  it  can  be  developed 
to  the  point  where  patterns  of  suboptimal  care  can  be  detected  prior  to  the 
occurrence  of  crises  in  patient  care,  and  if  the  detection  of  such  patterns  can 
activate  the  resources  of  the  health  care  system  to  correct  the  deficiencies. 

In  this  system,  the  importance  of  Biometrics  Activity  to  the  command  will 
increase  immeasureably,  for  it  will  be  constantly  called  upon  to  act  as  a  vital 
link  between  the  MTFs  and  the  MEDCOM  Commander. 

This  monitoring  system  is  compatible  with  JCAH  requirements.  It  assumes  a 
functioning,  locally  controlled  QAP,  and  it  assumes  the  continuation  of  the 
substructure  of  committees  and  peer  review  in  each  MTF.  For,  only  if  local 
efforts  are  successful,  can  quality  care  emerge  throughout  the  command. 

A  feature  key  to  the  successful  use  of  this  system  would  be  the  positive 
recognition  of  successful  local  QAPs.  This  must  include  the  commitment  of 
the  MEDCOM  Commander  to  intervene  at  an  early  stage  to  improve  areas  of 
patient  care  before  serious  problems  occur.  However,  if  this  system  is  used 
only  to  discover  "mistakes"  and  place  blame,  then  it  will  become  a  hinderance 
rather  than  a  help  to  delivery  of  quality  patient  care. 

If  used  in  a  positive  manner,  this  system  has  the  potential  for  recogni¬ 
zing  MTF  commanders  whose  facilities  deliver  exceptional  patient  care.  This 
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would  add  another  dimension  to  the  evaluation  of  a  commander's  ability  to  manage 
a  medical  facility,  and  would  improve  the  procedures  for  selecting  such  individ¬ 
uals  for  higher  level  command  and  administration  positions. 

This  system  will  not  prevent  cases  of  provider  malpratice,  nor  lapses  in 
care  that  occur  despite  the  best  efforts  of  direct  care  providers.  What  the 
system  can  do  is  prevent  institutionalized  subpar  performance.  Once  a  pat¬ 
tern  of  such  performance  is  recognized,  corrective  action  to  alleviate  the 
problem  is  mandated  at  the  level  of  recognition  (e.g.,  MEDCOM  level),  where  the 
marshalling  of  resources  is  possible. 

By  having  direct  access  to  information  on  the  quality  of  patient  care 
being  delivered  in  the  command,  the  MEDCOM  Commander  will  be  able  to  manage  his 
resources  in  the  most  appropriate  manner.  He  will  be  able  to  manage  by  using 
recent  patient  care  data,  rather  than  by  relying  on  last  year's  outmoded  physi¬ 
cal  plant  evaluation,  or  outdated  budget  and  personnel  estimates. 

Finally,  the  implementation  of  the  system  suggested  here  would  create 
a  mechanism  by  which  the  MEDCOM  could  readily  investigate  a  number  of  questions 
pertinent  to  the  broader  issues  of  QA  in  the  MEDCOM  system.  For  example  this 
system  would  lend  itself  to:  (1)  The  development  of  software  to  automate  the 
credential ing  process.  The  credentials  of  all  providers  could  be  contained  in  a 
central  data  base.  Each  record  would  indicate  the  procedures  that  the  provider 
is  credentialed  to  perform,  whether  the  procedure  could  be  performed  indepen¬ 
dently  or  with  supervision,  whether  the  procedure  is  time  relational  ( i . e . , 
needs  current  training  or  periodic  retraining),  and  the  date  and  location  of  the 
last  credenti al ing  committee  action.  (2)  The  development  of  software  to  include 
the  Risk  Management  Program  in  the  QAP.  Also,  biometric  data  could  be  compared 
with  litigation  claim  data,  by  malpractice  data,  by  MTF,  by  clinic  service  and 
by  provider.  (3)  The  development  of  an  automated  index  that  rates  the  effect  of 
Facility  obsolescence  on  the  quality  of  patient  care.  (4)  The  development  of  an 
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automated  system  for  identifying  equipment  in  each  MTF  which  is  essential  to 
quality  patient  care.  The  list  should  identify  manufacture,  model  number,  age, 
purchase  cost,  yearly  repair  cost,  and  percentage  of  "down"  time.  This  infor¬ 
mation  would  be  available  system  wide  in  a  convenient  and  readily  accessible 
form  and  could  be  used  by  providers  contemplating  the  purchase  of  new  medical 
equipment.  (5)  The  development  of  an  automated  system  to  report  the  occurrence 
of  critical  staff-patient  ratios  in  each  facility  work  center.  (6)  The  QA  data 
generated  by  the  Data  Based  QAP  could  be  used  as  part  of  a  multivariate  analysis 
of  the  Health  Care  Delivery  System.  For  example,  outcome  factors  could  be  com¬ 
pared  against  staff-patient  ratios,  staff  training  levels,  facility  ratings,  IG 
and  JCAH  inspection  ratings,  and  cost  per  patient,  in  order  to  develop  data 
based  management  indicators. 

The  model  suggested  here  would  bring  about  a  true  multihospital  QAP  for 
the  Uniformed  Services.  This  model  may  seem  radical  to  some.  However,  it  is 
not  radical  to  the  civilian  medical  community.  Already,  articles  have 
appeared  in  print  espousing  the  same  ideas,  and  the  Sisters  of  Mercy  Hospital 
Corporation  are  already  embarked  on  a  three  year  study  of  multihospital 
quality  assurance.  Moreover,  the  Department  of  Defense  is  pressuring  the 
services  to  improve  their  QA  programs.  Thus,  it  will  only  be  a  matter  of  time 
before  the  services  are  forced  to  begin  multihospital  QAPs. 

At  present,  we  are  in  a  position  tc  be  ahead  of  the  rest  of  the  nation  and 
to  institute  a  multihospital  QAP  to  meet  the  needs  of  the  uniformed  services. 

The  trained  personnel  are  available,  the  automated  systems  are  available,  and 
the  blueprint  is  available.  All  that  is  necessary  is  for  the  pieces  to  be 
brought  together  to  begin  a  truly  effective  and  modern  multihospital  QAP. 
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TR I -SERVICE  PERFORMANCE  MEASUREMENT  CONFERENCE 


Information  for  Group  Sessions 


Guidelines  for  All  Groups 

1.  Identify  the  major  issues  involved  in  each  topic  area. 

2.  Determine  what  specific  questions  must  be  answered. 

3.  Determine  the  status  of  Service  projects/activities/interest  in  this  area. 

4.  Suggest  what  types  of  studies  must  be  done. 

5.  Reco-i^end  how  labor  should  be  divided  between/within  the  Services  and  DOD. 

6.  Suggest  how  we  can  foster  Tri-Service  cooperation  in  this  area. 

7.  Relate  the  group  activity  to  what  we've  heard  at  the  conference  thus  far. 

8.  Answer  the  specific  questions  provided  below  to  the  extent  possible. 

9.  Prepare  to  report  the  results  of  group  activity  during  the  final  conference 
session  on  Friday  morning. 

Group  Topics 

Al.  PATIENT  CLASSIFICATION  SCHEMES  FOR  MILITARY  DATA  (Wednesday  Night) 

1.  Are  DRGs  the  way  to  go  based  on  what  we've  seen? 

2.  What  military  unique  variables  must  be  included? 

3.  What  additional  splits  might  be  made? 

4.  How  might  we  "regroup"  to  reduce  the  number  of  Military  Diagnosis  Related 
Groups  (MDRGs)? 

5.  Do  we  start  over  or  can  we  modify  existing  DRGs? 

6.  What  is  the  impact  of  ICD9  versus  ICD9-CM  coding  issues  on  DRGs? 

7.  What  is  the  current  status  of  coding  "maps"? 

8.  Should  we  recommend  a  switch  to  ICD9-CM? 

A2.  RELATIVE  WEIGHT  DETERMINATION  FOR  MDRGs  (Thursday  Night) 

1.  How  can  we  incorporate  severity  of  illness,  patient  acuity,  nursing 
intensity,  or  disease  staging  into  classification  and  weighting? 

2.  How  can  MDRGs  be  tied-in  with  UCA  for  work  load  accounting? 
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A2.  RELATIVE  WEIGHT  DETERMINATION  FOR  MDRG  (Continued) 

3.  What  should  be  the  basis  for  relative  weight  assignment? 

4.  Should  there  be  a  single  set  of  weights  for  DOD? 

5.  How  should  we  handle  "mission  clusters"  in  weight  determination? 

(Mission  Clusters  are  the  "product  lines"  our  facilities  have  to 
produce  by  mission  which  may  involve  types  of  care  with  very  low 
civilian  relative  resource  use  weights  but  which  are  very  important 
to  Congress. ) 

B.  CONCEPTUAL  APPROACHES  TO  PRODUCTIVITY  MEASUREMENT  (Wednesday  and  Thursday) 

1.  Does  efficacy/efficiency  make  sense  as  a  basis  for  monitoring  two 
distinctly  different  types  of  performance? 

2.  Should  we  replace  the  Composite  Work  Unit  (CWU  OR  MCCU)? 

3.  How  do  we  include  the  quality  component  in  performance  measurement? 

4.  What  types  of  measures  do  we  need  for  internal  Service  resource 
allocation? 

5.  What  types  of  measures  do  we  need  for  external  budget  defense  before 
Congress? 

6.  Who  should  set  individual  provider  and  clinlc/facillty/system 
performance  standards  for  quantity  and  quality  of  work  load? 

7.  What  information  on  individual  and  subsystem  performance  is  required  at 
the  facility,  command,  and  higher  levels?  (i.e.,  what  measures  and  types 
of  reports  are  required  for  decision  makers  at  various  levels?) 

C.  PERFORMANCE  MEASUREMENT  IN  NON-PATIENT  CARE  ACTIVITIES  (Wednesday  and 

Thursday) 

1.  How  do  we  currently  measure  overhead/readiness/mobilizatlon/training 
work  load  in  medical  treatment  facilities? 

2.  What  activities  should  we  measure  under  the  heading  of  "non-patient 
care?" 

3.  How  can  such  work  load  measurement  fit  In  with  the  UCA? 

4.  How  can  the  relative  weighting  of  Indirect  versus  direct  patient  care 
be  determined? 

5.  Should  any  proposed  system  track  individual  or  "department"  performance 
in  this  area? 

6.  Are  currently  utilized  "nonavailability  factors"  adequate? 

7.  How  can  such  work  load  be  Incorporated  Into  the  manpower  requirement  or 
authorization  process? 
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D.  AMBULATORY  CARE  DATA  CAPTURE  (Wednesday  and  Thursday) 

1.  What  specific  data  elements  should  be  included  in  an  Ambulatory  Care 
Data  Base  (ACD8)? 

2.  What  level  of  detail  in  coding  diagnoses  and  procedures  and  capturing 
provider  and  resource  use  information  is  required? 

3.  Is  the  basic  unit  of  observation  for  an  ACDB  the  visit  or  the  episode? 

4.  What  is  the  best  method  for  data  capture? 

5.  What  is  the  potential  impact  of  proposed  data  capture  methods  on  the 
local  facility? 

6.  How  can  an  ACOB  tie-in  to  UCA  and  TR IMIS  systems? 

7.  What  should  be  the  basis  for  developing  a  set  of  Ambulatory  Patient 
Clusters  (APC)? 

8.  Must  the  same  coding  system  be  used  for  both  inpatient  and  ambulatory 
patient  data  collection? 

9.  How  could  an  ACDB  become  the  basis  for  an  Automated  Quality  Assurance 
System? 


Suggested  Group  Assignments  (*Co-Chairpersons ) 


Al.  PATIENT  CLASSIFICATION  SCHEMES  FOR  MILITARY  DATA 


A2.  RELATIVE  WEIGHT  DETERMINATION  FOR  MDRGs 


LTC  Arnt 
Ms.  Austin 
*MAJ  Baker 
CPT  Fye 
Ms.  Glover 
MAJ  Heckert 


Mr.  Kay 
Ms.  Lindsay 
Mr.  Pasternack 
*CDR  Rieder 
Ms.  Scott 
LTC  Vorpohl 


B.  CONCEPTUAL  APPROACHES 

MAJ  Crutchfield 
*CQL  Fields 
LTC  Miller 
LTC  Moore 
.APT  Morin 
*CPT  Optenberg 


TO  PRODUCTIVITY  MEASUREMENT 

CPT  Patterson 
Mr.  Rushdi 
LTC  Smith 
LCDR  Tompkins 
LTC  Vago 


C.  PERFORMANCE  MEASUREMENT  IN  NON-PATIENT  CARE  ACTIVITIES 


MAJ  Abshire 
Ms.  Bateman 
CPT  Bigelow 
MAJ  Borders 
Mr.  Oavenport 


*LCDR  Ford 
*CPT  King 
MAJ  O'Brien 
Dr.  Mangelsdorff 


D.  AMBULATORY  CARE  DATA  CAPTURE 


LCDR  Boyer 
MAJ  Clement 
MAJ  Evans 
MAJ  Leahy 
*LTC  Misener 


CPT  Morreale 
Mr.  Pasternack 
*CAPT  Pleet 
COL  Rosenberg 
COL  Young 


*Co-Chairpersons  are  responsible  for  assisting  the  Organizational  Effectiveness 
(O.E.)  facilitators  in  directing  the  attention  of  the  group  to  the  suggested 
activities  and  not  necessarily  for  providing  expert  input  to  the  subject  area. 
They  are  also  responsible  for  assuring  the  group  is  ready  to  "report  out"  on 
Friday  morning. 
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TR I -SERVICE  PERFORMANCE  CONFERENCE 

Results  of  Group  Sessions 

GROUP  Al:  PATIENT  CLASSIFICATION  SCHEMES  FOR  MILITARY  DATA 

1.  Are  DRGs  the  way  to  go  based  on  what  we've  seen? 

a.  Yes,  but  they  must  be  tailored  to  fit  the  military  situation. 

b.  We  will  need  additional  instruments  or  measures  as  we  go. 

c.  DRGs  do  not  adequately  account  for  severity  of  illness. 

d.  The  time  is  very  ripe  to  proceed  with  DRG  analysis. 

e.  We  must  develop  standards  and  times  for  nursing  care. 

2.  What  military  unique  variables  must  be  included? 

a.  Disability  separation. 

b.  Convalescent  leave. 

c.  Transfers  in  or  out. 

d.  Age  of  patient  or  length  of  service. 

e.  Patient  categories  to  include  enlisted  versus  officer  status. 

3.  What  additional  splits  might  be  made? 

a.  We  don't  know  now  but  some  will  be  necessary. 

b.  We  will  know  more  as  analysis  concerning  question  2  progresses. 

4.  How  might  we  "regroup"  to  reduce  the  numbe.  of  Military  Diagnosis 
Related  Groups  (MDRGs)? 

a.  Group  does  not  wish  to  answer  at  this  time. 

b.  Left  to  each  service's  discretion. 

5.  Do  we  start  over  or  can  we  lodify  existing  DRGs? 

(see  previous  answers) 

6.  What  is  the  impact  of  I  CD-9  versus  ICD-9-CM  coding  issues  on  DRGs? 
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a.  Preliminary  results  seem  to  indicate  great  significance. 

b.  To  be  determined  by  ongoing  studies. 
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7.  What  is  the  current  status  of  coding  "maps"? 

a.  Several  versions  under  various  states  of  evaluation/revision. 

b.  Will  eventually  need  to  have  a  single  map  for  all  DOD  to  use. 

8.  Should  we  recommend  a  switch  to  ICD-9-CM? 

a.  Not  should  we  but  when  can  we? 

b.  Need  to  conduct  a  study  (decision  paper)  of  advantages,  disad¬ 
vantages,  quality  of  coding,  life  cycle  costs,  etc. 

c.  Link  efforts  to  Wilford  Hall  study. 

d.  There  are  more  than  performance  measurement  reasons  to  recommend 
considering  ICD-9-CM. 

GROUP  A2:  RELATIVE  WEIGHT  DETERMINATION  FOR  MDRGs 

1.  How  can  we  incorporate  severity  of  illness,  patient  acuity,  nursing 
intensity,  or  disease  staging  into  classification  and  weighting? 

a.  Only  after  very  careful  study. 

b.  Severity  of  Illness:  appears  meaningful;  needs  to  be  tested; 
would  have  data  availability  problems  for  the  services. 

c.  Patient  Acuity/Nursing  Intensity:  As  current  implementations 
continue  to  take  place  validate  and  incorporate  into  DRG 
analyses . 

d.  Disease  Staging:  Army  will  pursue  with  SysteMetrics  software. 

2.  How  can  MDRGs  be  tied  in  with  UCA  for  workload  accounting? 

a.  Investigate  TRIPAD  (Army,  Navy). 

b.  Army  and  Navy  review  Wilford  Hall  process. 

c.  Air  Force  replicate  process  at  other  facilities. 

3.  What  should  be  the  basis  for  relative  weight  assignment? 

a.  As  an  interim  measure  all  services  should  use  the  HCFA  weights 
with  updates  as  they  become  available. 

b.  Each  service  can  explore  modifications  independently. 

c.  Eventually  tie  into  UCA  cost  analysis. 

4.  Should  there  be  a  single  set  of  weights  for  DOD? 

Yes. 


5. 


How  should  we  handle  "Mission  Clusters"  in  weight  determination? 
(Mission  Clusters  are  the  "Product  Lines"  our  facilities  have  to 
produce  by  mission  which  may  involve  types  of  care  with  very  low 
civilian  relative  weights  but  which  are  very  important  to  Congress 
as  part  of  the  readiness  or  benefit  missions.) 

The  group  did  not  think  it  wise  to  consider  this  concept  at  this 
time.  We  should  stick  with  the  same  weights  as  HCFA  for  now  and 
explain  our  differences  in  some  other  manner  besides  weights.  We 
need  external  comparability  at  this  time. 

GROUP  B:  CONCEPTUAL  APPROACHES  TO  PRODUCTIVITY  MEASUREMENT 

1.  Does  efficacy/efficiency  make  sense  as  a  basis  for  monitoring  two 
distinctly  different  types  of  performance? 

Yes,  but  not  sure  how  to  implement  at  this  time. 

2.  Should  we  replace  the  composite  work  unit  (CWU  or  MCCU)? 

Unanimous  agreement  that  status  quo  is  inadequate. 

3.  How  do  we  include  the  quality  component  in  performance  measurement? 
Not  sufficient  time  to  address  during  this  conference. 

4.  What  types  of  measures  do  we  need  for  internal  service  resource 
al location? 

a.  DRGs  will  be  the  main  basis  for  measurement,  to  be  augmented  by 
other  measures  to  be  developed. 

b.  Ultimate  goal:  Resource  allocation  based  on  DRGs  interfaced 
with  UCA/USM,  with  provisions  for  other  measures  as  they  emerge. 

5.  What  types  of  measures  do  we  need  for  external  budget  defense  before 
Congress? 

Same  measures  will  be  used  for  internal  and  external  budget  defense 
to  the  extent  possible. 

6.  Who  should  set  individual  provider  and  clinic/facility/system  per¬ 
formance  standards  for  quantity  and  quality  of  workload? 

Standards  will  result  from  normative  data  produced  under  the  DRG 
system.  All  Levels  of  command  can  utilize  these  norms  in  setting 
standards  to  augment  existing  quality  assurance  programs. 

7.  What  information  on  individual  and  subsystem  performance  is  required 
at  the  facility,  command,  and  higher  levels? 


Not  addressed. 


8.  Additional  question  discussed.  How  can  we  continue  the  contacts 
made  and  discussions  begun  at  this  conference? 

a.  Information  briefings  should  be  presented  to  each  service  Sur¬ 
geon  General . 

b.  Some  type  of  formal  information  sharing  structure  will  be 
created. 

c.  Quarterly  "mi ni -conferences"  should  be  held  for  key  workers  from 
each  service  on  selected  topics  to  review  progress  and  plans. 

d.  Formal  division  of  labor  at  this  early  stage  is  not  recommended. 
GROUP  C:  PERFORMANCE  MEASUREMENT  IN  NON-PATIENT  CARE  ACTIVITIES 

1.  How  do  we  currently  measure  overhead/readiness/mobilization/training 
workload  in  medical  treatment  facilities? 

a.  Uniform  Staffing  Methodology/Uniform  Chart  of  Accounts. 

b.  Cost  Accounts  Codes/Army  Management  Structure. 

c.  Uniform  Chart  of  Accounts  Personnel  System  (UCAPERS). 

2.  What  activities  should  we  measure  under  the  heading  of  "Non-patient 
Care"? 

a.  All  administrative  overhead. 

b.  E  and  F  accounts  within  UCA  as  applicable. 

c.  Continuing  Medical  Education. 

d.  Committees  and  reporting. 

e.  Clinical  Investigations  and  research. 

f.  Local  travel. 

g.  Command  and  control. 

h.  Liaison  duties. 

3.  How  can  such  workload  measurements  fit  in  with  UCA? 

a.  Make  the  UCA  accounts  compatible  with  USM. 

b.  Flesh  out  both  the  UCA  and  USM  in  non-patient  care  activities. 
Must  have  fully  automated  UCA/USM  and  local  command  emphasis. 

4.  How  can  the  relative  weighting  of  indirect  versus  direct  patient 
care  be  determined? 

The  weighting  is  implicit  in  the  expanded  reporting  process. 


Should  any  proposed  system  track  individual  or  "Department"  perfor¬ 
mance  in  this  area? 


Individual  will  allow  us  to  capture  at  all  levels. 

6.  Are  currently  utilized  "Nonavailability  factors"  adequate? 
No. 


7.  How  can  such  workload  be  incorporated  into  the  manpower  requirement 
or  authorization  process? 

a.  These  activities  better  describe  the  mission  requirements  and  as 
such  car  be  incorporated  into  manpower  staffing  standards. 

b.  The  comparison  of  requirements  to  authori zations  can  be  done 
more  real istical ly. 


ASSUMPTIONS/LIMITATIONS 

A.  While  military  readiness  fails  within  UCA  it  is  understood  that 
medical  readiness  usually  falls  within  direct  patient  care. 

B.  Non-patient  care  activities  are  not  homogeneous. 

C.  We  are  addressing  only  human  patient  care  providers. 


RECOMMENDATIONS 

A.  A  standardized  profiling  system  at  the  provider  level  within  all 
services  to  include  a  coordinated,  uniform  implementation  program. 

B.  A  Tri-Service  Task  Force  to  study  the  comparability  of  data. 

C.  Each  service  should  contribute  an  equitable  data  base  to  a  Tri-service 
Performance  Measurement  effort. 

D.  Increase  ancillary  services  (e.g.,  pharmacy,  lab,  preventive  medi¬ 
cine,  occupational  health,  radiology,  dietary,  vet,  etc.)  involve¬ 
ment  in  future  performance  measurement  activities. 


STATUS  OF  SERVICE  PRO JECTS/AC T I V  IT  I ES 

A.  Monthly  reporting  system  on  personnel  utilization  in  contingency 
operations  (Navy). 

B.  APJRS/MSjS,  UCAPERS,  Performance  Measurement  Study  (Army). 

C.  PRISM  -  Producti vity/eff iciency  model  using  weighted  units  (Air 
r  or  op  ) . 


362 


GROUP  D:  AMBULATORY  CARE  DATA  CAPTURE 

1.  What  specific  data  elements  should  be  included  in  an  Ambulatory  Care 
Data  Base  ( ACD8 ) ? 

a.  Demographics. 

b.  Diagnoses. 

c.  Procedures. 

d.  Examinations. 

e.  Time  spent  (patient  with  provider). 

f.  Who  sees  patient  (provider  registration) 

2.  What  level  of  detail  in  coding  diagnoses  and  procedures  and  cap¬ 
turing  provider  and  resource  use  information  is  required? 

6-Byte  field. 

3.  Is  the  basic  unit  of  observation  for  an  ACDB  the  visit  or  the 
episode? 

Visit.  In  the  future  we  may  be  called  to  look  more  at  episodes  of 
care.  Data  is  not  available  to  do  so  at  this  time. 

4.  What  is  the  best  method  for  data  capture? 

Forms  presented  by  the  Army  as  they  "bulldozed"  this  concept  through 
the  work  group. 

5.  What  is  the  potential  impact  of  proposed  data  capture  methods  on  the 
local  facility? 

a.  Cost. 

b.  Learning  curve. 

c.  Command  emphasis  essential. 

d.  Physician  acceptance. 

e.  Logistics. 

f.  Data  security. 

6.  How  can  an  ACDb  tie  into  UCA  and  TR IMIS  systems? 

By  workoer te'  as  well  as  •  isit.  count.  Patients  tied  in  by  SSN. 
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7.  What  should  be  the  basis  for  developing  a  set  of  Ambulatory  Patient 
Clusters  (APCs)? 

Long-term  experience  with  the  system. 

8.  Must  the  same  coding  system  be  used  for  both  inpatient  and  ambula¬ 
tory  patient  data  collection? 

Preferably  use  the  same  coding  system  for  both  -  continuity. 

9.  Could  an  ACDB  become  the  basis  for  an  automated  quality  assurance 
system? 

A  partial  yes. 
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Conference  Evaluation 

Please  indicate  the  extent  to  which 
you  agree  or  disagree  with  the  fol¬ 
lowing  statements. 

STRONGLY 

DISAGREE 

DISAGREE 

UNDECIDED 

AGREE 

R  . 

r 

r  ’ 
r 

1.  The  advance  notice  I  received  con¬ 
cerning  the  purpose  of  the  conference 
was  sufficient  for  me  to  prepare  myself 
as  a  participant. 

03% 

14% 

03% 

45% 

35% 

< 

' 

2.  The  facilities  (condominiums,  meet¬ 
ing  rooms,  etc.)  were  adequate  and  pro¬ 
vided  an  environment  that  enhanced  the 
exchange  of  information. 

03% 

21% 

76% 

« 

3.  The  food  served  was  good  in  both 
quantity  and  quality. 

03% 

14% 

45% 

38% 

- 

’ 

4.  The  administrative  support  (e.g., 
conference  staff,  break  periods,  trans¬ 
portation,  etc.)  were  adequate. 

03% 

10% 

25% 

62% 

< 

5.  The  recreational  facilities  enabled 
me  to  participate  in  adequate  physical 
activity  during  the  conference. 

07% 

10% 

35% 

48% 

:« 

6.  The  civilian  guest  speakers  pro¬ 
vided  up-to-date  information  which  can 
be  effectively  used  by  developers  of 
performance  measures  in  the  military 
health  care  environment. 

21% 

79% 

j 

7.  The  Service  projects  update  pro¬ 
vided  adequate  information  on  the  sta¬ 
tus  of  performance  measurement  activi¬ 
ties  in  the  Army,  Navy,  and  Air  Force. 

28% 

72% 

4 

3.  'he  military  speakers  provided  in¬ 
formation  which  will  be  useful  to  me 
in  my  future  involvement  with  perfor¬ 
mance  measurement. 

03% 

31% 

66% 

9.  The  group  sessions  were  well- 
structured  and  allowed  a  free  ex¬ 
change  of  information  on  important 
topics.  (Group  #  ) 

07% 

34% 

59% 

4 

10.  The  group  sessions  reports  on 

Friday  morning  provided  sufficient 
information  on  the  accomplishments 
o£  the  groups. 

52% 

24“ 

24% 

4 

(Please 

see  other 

side . ) 

_ _ 
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Conference  Evaluation  (continued) 


Please  indicate  the  extent  to  which 
you  agree  or  disagree  with  the  fol¬ 
lowing  statements. 

STRONGLY 

DISAGREE 

DISAGREE 

UNDECIDED 

AGREE 

STRONGL 

AGREE 

11.  This  conference  should  facilitate 
future  Tri-Service  cooperation  leading 
toward  unified  approaches  to  perform¬ 
ance  measurement. 

07% 

10% 

83% 

12.  My  purposes  for  attending  this 
conference  were  fulfilled. 

34% 

66% 

13.  The  thing  I  liked  best  about  this  conference  was:  amount  and  quality  of  information; 


civilian  presentations;  excellent  facilities;  format  and  selection  of  attendees;  level  of 
exchange;  meeting  other  service  members;  military  stance;  on  schedule;  tri-service  inter¬ 
action:  well  organized;  work  groups;  POD  and  interservice  activities  update. _ 


14.  The  thing  I  liked  least  about  this  conference  was:  evening  sessions:  excessive  atten¬ 
tion  to  statistical  processes;  food;  gravy;  inability  to  attend  all  sessions;  lack  of  free 
f- imp ;  lack  of  handouts  for  presentations;  lack  of  preparation  time;  lack  of  mixed  room  as¬ 
signments  (need  to  integrate  services);  lack  of  racouetball  facilities;  long  presentations 
without  breaks;  tight  schedule  on  Thursday;  lack  of  gathering  places  to  sit  around  and  tali 

15.  Additional  comments:  "best  conference  I  ever  attended";  "need  more  balance  among  the 

services  in  presentations" :  "need  more  advance  notice  for  preparation  to  attend";  "civil iar 
guest  speakers  were  great";  "would  have  enjoyed  less  formal  structure  and  more  time  to 
interact  with  representati ves  from  other  services". _ 


SERVICE  IDENTIFICATION  (Optional)  .  Circle  One 

Military:  ARMY  NAVY  AIR  FORCE 

Civilian  Emp:  ARMY  NAVY  AIR  FORCE 
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